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Abstract. A model for a subject's beliefs about a phenomenon may exhibit symmetry, 
in the sense that it is invariant under certain transformations. On the other hand, such a 
belief model may be intended to represent that the subject believes or knows that the phe- 
nomenon under study exhibits symmetry. We defend the view that these are fundamentally 
different things, even though the difference cannot be captured by Bayesian belief mod- 
els. In fact, the failure to distinguish between both situations leads to Laplace's so-called 
Principle of Insufficient Reason, which has been criticised extensively in the hterature. 

We show that there are belief models (imprecise probability models, coherent lower 
previsions) that generalise and include the Bayesian belief models, but where this fun- 
damental difference can be captured. This leads to two notions of symmetry for such 
belief models: weak invariance (representing symmetry of beliefs) and strong invariance 
(modelling beliefs of symmetry). We discuss various mathematical as well as more philo- 
sophical aspects of these notions. We also discuss a few examples to show the relevance of 
our findings both to probabilistic modelling and to statistical inference, and to the notion 
of exchangeability in particular. 



1. Introduction 

This paper deals with symmetry in relation to models of beliefs. Consider a model for 
a subject's beliefs about a certain phenomenon. Such a belief model may be symmetrical, 
in the sense that it is invariant under certain transformations. On the other hand, a belief 
model may try to capture that the subject believes that the phenomenon under study exhibits 
symmetry, and we then say that the belief model models symmetry. We defend the view 
that there is an important conceptual difference between the two cases: symmetry of beliefs 
should not be confused with beliefs of symmetryO 

Does this view need defending at all? That there is a difference may strike you as 
obvious, and yet we shall argue that Bayesian belief models, which are certainly the most 
popular beUef models in the literature, are unable to capture this difference. 

To make this clearer, consider a simple example. Suppose I will toss a coin, and you 
are ignorant about its relevant properties: it might be fair but on the other hand it might 
be heavily loaded, or it might even have two heads, or two tails (situation A). To you 
the outcomes of the toss that are practically possible are h (for heads) and t (for tails). 
Since you are ignorant about the properties of the coin, any model for your beliefs should 
not change if heads and tails are permuted, so the model that 'faithfully' captures your 
beliefs about the outcome of the toss should be symmetrical too, i.e., invariant under this 
permutation of heads and tails. 
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'This echoes Iwallevl 's Il99ll Section 9.5.6, p. 466] view that 'symmetry of evidence' is not the same thing as 
'evidence of symmetry'. 
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Suppose on the other hand that you know that the coin (and the tossing mechanism) I 
shall use is completely symmetrical (situation B). Your belief model about the outcome 
of the toss should capture this knowledge, i.e., it should model your beliefs about the 
symmetry of the coin. 

Our point is that belief models should be able to catch the important difference between 
your beliefs in the two situations. Bayesian belief models cannot do this. Indeed — the 
argument is well-known — the only symmetrical probability model, which is in other words 
invariant under permutations of heads and tails, assigns equal probability 1/2 to heads and 
tails. But this is automatically also the model that captures your beliefs that the coin is 
actually symmetrical, so heads and tails should be equally likely. 

The real reason why Bayesian belief models cannot capture the difference between 
symmetry of models and modelling symmetry, is that they do not allow for indecision. 
Suppose that I ask you to express your preferences between two gambles, whose reward 
depends on the outcome of the toss. For first one, a, you will win one euro if the outcome 
is heads, and lose one if it is tails. The second one, b, gives the same rewards, but with 
heads and tails swapped. 

In situation B, because you believe the coin to be symmetrical, it does not matter to you 
which gamble you get, and you are indifferent in your choice between the two. 

But in situation A, on the other hand, because you are completely ignorant about the 
coin, the available information gives you no reason to (strictly) prefer a over b or b over a. 
You are therefore undecided about which of the two gambles to choose. 

Because decision based on Bayesian belief models leaves you no alternative but to either 
strictly prefer one action over the other, or to be indifferent between them, the symmetry 
of the model leaves you no choice but to act as if you were indifferent between a and b. 
We strongly believe that it is wrong to confuse indecision with indifference in this example 
(and elsewhere of course), but Bayesian belief models leave you no choice but to do so, 
unless you want to let go of the principle that if your evidence or your beliefs are sym- 
metrical, your belief model should be symmetrical as well. The problem with Laplace's 
Principle of Insufficient Reason is precisely this: if you use a Bayesian probability model 
then the symmetry present in ignorance forces you to treat indecision (or insufficient rea- 
son to decide) between a and b as if it were indifference^ Or in other words, it forces you 
to treat symmetry of beliefs as if there were beliefs of symmetry. 

If on the other hand, we consider belief models that allow for indecision, we can sever 
the unholy link between indecision and indifference, because in a state of complete igno- 
rance, we are then allowed to remain undecided about which of the two actions to choose: 
in the language of preference relations, they simply become incomparable, and you need 
not be indifferent between them. As we shall see further on, similar arguments show that 
such belief models also allow us to distinguish between 'symmetry of models' and 'mod- 
els of symmetry' in those more general situations where the symmetry involved is not 
necessarily that which goes along with complete ignorance. 

So, it appears that in order to better understand the interplay between modelling beliefs 
and issues of symmetry, which is the main aim of this paper, we shall need to work with 
a language, or indeed, with a type of belief models that, unlike the Bayesian ones, take 



This may seem a good explanation whv lKevneslll92lL p. 83] renamed the 'Principle of Insufficient Reason' 
the 'Principle of Indifference'. He (and others. see lZabelll Il989bll ) also suggested that the principle should not 
be applied in a state of complete ignorance, but only if there is good reason to justify the indifference (such as 
when there is evidence of symmetry). By the way, Keynes was also among the first to consider what we shall call 
imprecise probability models, as his comparative probability relations were not required to be complete. 
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indecision seriousl y. For this purp ose, we shall use the language of the so-called imprecise 



probability models llWalleyLll991ll . and in particular coherent lower previsions, which have 



the same behavioural pedigree as the more c ommon Bayesian belief models (in casu co- 



herent previsions, see ,de Finettil 111974-197511 ). and which contain these models as a special 



case. We give a somewhat unusual introduction to such models in Section|20 In Section[3] 
we provide the necessary mathematical background for discussing symmetry: we discuss 
monoids of transformations, and invariance under such monoids. After these introductory 
sections, we start addressing the issue of symmetry in relation to belief models in SectionS) 
We introduce two notions of invariance for the imprecise probability models introduced in 
Section|2] weak invariance, which captures symmetry of belief models, and strong invari- 
ance, which captures that a model represents the belief that there is symmetry. We study 
relevant mathematical properties of these invariance notions, and argue that the distinction 
between them is very relevant when dealing with symmetry in general, and in particular 
(Section|5]l for modelling complete ignorance. Further interesting properties of weak and 
strong invariance, related to inference, are the subject of Sections|6]and|2l respectively. We 
show among other things that a weakly invariant coherent lower prevision can always be 
extended to a larger domain, in a way that is as conservative as possible. This implies that, 
for any given monoid of transformations, there always are weakly invariant coherent lower 
previsions. This is not generally the case for strong invariance, however, and we give and 
discuss sufficient conditions such that for a given monoid of transformations, there would 
be strongly invariant coherent (lower) previsions. We also give various expression for the 
smallest strongly invariant coherent lower prevision that dominates a given weakly invari- 
ant one (if it exists). In Section [8] we turn to the important example of coherent (lower) 
previsions on the set of natural numbers, that are shift-invariant, and we use them to charac- 
terise the strongly invariant coherent (lower) previsions on a general space provided with a 
single transformation. Further examples are discussed in Section|9l where we characterise 
weak and strong invariance with respect to finite groups of permutations. In par ticular. 



we discuss WaHex's [19911 generalisation to lower previsions of de Finetti's 11193711 notion 
of exchangeabihty, and we use our characterisation of strong permutation invariance to 
prove a generalisation to lower previsions of de Finetti's representation results for finite 
sequences of exchangeable random variables. Conclusions are gathered in SectionfTOl 

We want to make it clear at this point that this paper owes a significant intellectual 
debt to Peter Walley. First of all, we use his behavioural imprecise probability models 
iWalley, 1991] to try and clarify the distinction between symmetry of beliefs and beliefs 
of symmetry. Moreover, although we like to believe that much of what we do here is new, 
we are also aware that in many cases we take to their logical conclusion a number of ideas 
about symmetry that are clearly present in h is work (mainly Walley [1991, Sections 3.5, 
9.4 and 9.5] and IPericchi and Walleyl 1 1991 1). sometimes in embryonic form, and often 
more fully worked out. 

2. Imprecise probability models 

Consider a very general situation in which uncertainty occurs: a subject is uncertain 
about the value that a variable X assumes in a set of possible values Because the 
subject is uncertain, we shall call X an uncertain, or random, variable. 



For other brief and perhap s more conventional introduction s to the topic, we refer to lwaiie^l ll996iJl . 
iDe Cooman and ZaffaloJ 120041 ■ loe Cooman and Troffaej l2004ll . lOe Cooman and M iranda' f2006^. A much 
more detailed account of the behavioural theory of imprecise probabilities can be found in Walley 1 19911 . 
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The central concept we shall use in order to model our subject's uncertainty about X, is 
that of a gamble (on X, or on which is a bounded real-valued function / on . In 
other words, a gamble / is a map from ^ to the set of real numbers M such that 

sup / := sup{/(;c) : x G ^} and inf/ := \nf{f{x) : x G ^} 

are (finite) real numbers. It is interpreted as the reward function for a transaction which 
may yield a different (and possibly negative) reward f{x), measured in units (called utiles) 
of a pre-determined linear utilityjj for each of the different values x that the random variable 
X may assume in JF. 

We denote the set of all gambles on X by ^(^). For any two gambles / and g, we 
denote their point- wise sum by / + and we denote the point- wise (scalar) multiplica- 
tion of / with a real number X by A/. ^(^) is a real linear space under these opera- 
tions. We shall always endow this space with the supremum norm, i.e., ||/|| = sup|/| = 
sup{|/(x)| : X e J^}, or equivalently, with the topology of uniform convergence, which 
turns ^(c^ ) into a Banach space. 

An event A is a subset of If X S A then we say that the event occurs, and ifX^A 
then we say that A doesn't occur, or equivalently, that the complement(ary event) A"" = 
{x e ^ : X ^ A} occurs. We shall identify an event with a special {0, 1 }-valued gamble 
called its indicator, and defined by /^(x) = 1 if x G A and /a(x) = elsewhere. We shall 
often write A for I a, whenever there is no possibility of confusion. 

2. 1 . Coherent sets of really desirable gambles. Given the information that the subject 
has about X, she will be disposed to accept certain gambles, and to reject others. The idea 
is that we model a subject's beliefs about X by looking at which gambles she accepts, and 
to collect these into a set of really desirable gambles 3^. 

The dice example. Assume that our subject is uncertain about the outcome X of my tossing 
a die. In this case ^ = J?^ := {1,2,3,4,5,6} is the set of possible values for X. If the 
subject is rational, she will accept the gamble which yields a positive reward whatever the 
value of X, because she is certain to improve her 'fortune' by doing so. On the other hand, 
she will not accept a non-positive gamble that is negative somewhere, because by accepting 
such a gamble she can only lose utility (we then say she incurs a partial loss). She will 
not accept the gamble which makes her win one utile if the outcome X is 1, and makes her 
lose five utiles otherwise, unless she knows for instance that the die is loaded very heavily 
in such a way that the outcome 1 is almost certain to come up. 

Real desirability can also be interpreted in terms of the betting behaviour of our subject. 
Suppose she wants to bet on the occurrence of some event, such as my throwing 1 (so that 
she receives 1 utile if the event happens and utiles otherwise). If she thinks that the die 
is fair, she should be disposed to bet on this event at any rate r strictly smaller than g. This 
means that the gamble — r representing this transaction (winning 1 — r if the outcome 
of X is 1 and losing r otherwise) will be really desirable to her for r < ^ . ♦ 

Now, accepting certain gambles has certain consequences, and has certain implications 
for accepting other gambles, and if our subject is rational, which we shall assume her to 
be, she should take these consequences and implications into account. To give but one 
example, if our subject accepts a certain gamble / she should also accept any other gamble 

This utility can be regarded as amounts of money, as is the case for instance in Ide Finettil Il974^1975h . It 
is perhaps more realistic, in the sense that the linearity of the scale is better justified, to interpre t it in terms of 
probability currency: we win or lose lottery tickets depending on the outcome of the gamble; see IWalle^^ ll99lL 
Section 2.2]. 
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g such that g > f, i.e., such that g point-wise dominates /, because accepting g is certain 
to bring her a reward that is at least as high as accepting / does. 

Actually, this requirement is a consequence [combine (D2) with (D3)] of the follow- 
ing four basic rationality axioms for real desirability, which we shall assume any rational 
subject's set of really desirable gambles ^ to satisfy: 
(Dl) if / < then f [avoiding partial loss]; 
(D2) if / > then / G ^ [accepting sure gains]; 

(D3) if / G ^ and g^Si then f + g^Si [accepting combined gambles] 
(D4) if / G ^ and A > then If e^. [scale invariance]. 

where / < g is shorthand for f < g and / ^ g^ We call any subset ^ of ^(^') that 
satisfies these axioms a coherent set of really desirable gambles. 

It is easy to see that these axioms reflect the behavioural rationality of our subject: (Dl) 
means that she should not be disposed to accept a gamble which makes her lose utiles, no 
matter the outcome; (D2) means that she should accept a gamble which never makes her 
lose utiles; on the other hand, if she is disposed to accept two gambles / and g, she should 
also accept the combination of the two gambles, which leads to a reward f + g; this is an 
immediate consequence of the linearity of the utility scale. This justifies (D3). And finally, 
if she is disposed to accept a gamble /, she should be disposed to accept the scaled gamble 
Xf for any A > 0, because this just reflects a change in the linear utility scale. This is the 
idea behind condition^ (D4). 

IWallevlil99lll2000ll has a further coherence axiom that sets of really desirable gambles 
should satisfy, which turns out to be quite important for conditioning, namely 

(D5) if ^ is a partition of and if Ib/ G ^ for aU B in ^, then / G ^ [full conglom- 
erability] . 

Since this axiom is automatically satisfied whenever ^ is finite [it is then an immediate 
consequence of (D3)], and since we shall not be concerned with conditioning unless when 
^ is finite (see Section|9]l, we shall ignore this additional axiom in the present discussion. 

A coherent set of really desirable gambles is a convex cone [axioms (D3)-(D4)] that 
includes the 'non-negative orthant' := {/ G ^{^) : / > 0} [axiom (D2)] and has no 
gamble in common with the 'negative orthant' := {/ G ^( JT) : / < 0} [axiom (D1)]0 
If we have two coherent sets of really desirable gambles and such that ^1 C 
then we say that ^1 is less committal, or more conservative, than because a subject 
whose set of really desirable gambles is ^2 accepts at least all the gambles in ^1 . The 
least-committal (most conservative, smallest) coherent set of really desirable gambles is 
Within this theory, it seems to be the appropriate model for complete ignorance: if our 
subject has no information at all about the value of X, she should be disposed to accept only 
those gambles which cannot lead to a loss of utiles (see also the discussion in Section|5]l. 

Now suppose that our subject has specified a set ^ of gambles that she accepts. In an 
elicitation procedure, for instance, this would typically be a finite set of gambles, so we 
cannot expect this set to be coherent. We are then faced with the problem of enlarging this 
^ to a coherent set of really desirable gambles that is as small as possible: we want to 
find out what are the (behavioural) consequences of the subject's accepting the gambles 
in taking into account only the requirements of coherence. This inference problem is 

^So, here and in what follows, we shall write '/ < 0' to mean '/ < and not / = 0', and '/ > 0' to mean 
7>0andnot/ = 0'. 

^This means that the zero gamble belongs to the set of really desirable gambles. This is more a mathematical 
convention than a behavioural requirement, since this gamble has no effect whatsoever in the amount of utiles of 
our subject. See more details in iWallev. L1991i1 . 
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(also formally) similar to the problem of inference (logical closure) in classical proposi- 
tional logic, where we want to find out what are the consequences of accepting certain 
propositions]] 

The smallest convex cone including and or in other words, the smallest subset 
of ££{3^) that includes Si and satisfies (D2)-(D4), is given by 

|g e ^{SC) : g > Y^Xi^ff, for some « > 0, G M+ and A G ^| , 

where IR+ denotes the set of non-negative real numbers. If this convex cone intersects 
then it is easy to see that actually Sl^ — and then it is impossible to extend Si, 

to a coherent set of really desirable gambles [because (Dl) cannot be satisfied]. Observe 
that <r|, n "ifi = if and only if 

n 

there ai-e no n > 0, A<; £ M+ and £ ^ such that '^kfk < 0, 

k=l 

and we then say that the set S^ avoids partial loss. Let us interpret this condition. As- 
sume that it doesn't hold (so we say that ^ incurs partial loss). Then there are really 
desirable gambles /i, ...,/„ and positive Aj, . . . , A„ such that '£!k=i ^kfk < 0. But if our 
subject is disposed to accept the gamble f/^ then by coherence [axioms(D2) and (D4)] she 
should also be disposed to accept the gamble X^ft for all > 0. Similarly, by coherence 
[axiom (D3)] she should also be disposed to accept the sum YJk=i ^kfk- Since this sum is 
non-positive, and strictly negative in at least some elements of we see that the subject 
can be made subject to a partial loss, by suitably combining gambles which she accepts. 
This is unreasonable. 

When the class ^ avoids partial loss, and only then, we are able to extend ^ to a 
coherent set of really desirable gambles, and the smallest such set is precisely ^fj, which is 
called the natural extension of ^ to a set of really desirable gambles. This set reflects only 
the behavioural consequences of the assessments present in Si: the acceptance of a gamble 
/ not in S'!^ (or, equivalently, a set of really desirable gambles strictly including <f^) is not 
implied by the information present in Sf,, and therefore represents stronger implications 
that those of coherence alone. 

2.2. Coherent sets of almost-desirable gambles. Coherent sets of really desirable gam- 
bles constitut e a very general and powerful c lass of models for a subject's beliefs (see 
WaUevI II1991I Appendix F] and lWallevI ll200(]ll for more details and discussion) We could 



already discuss symmetry aspects for such coherent sets of really desirable gambles, but 
we shall instead concentrate on a slightly less general and powerful type of belief models, 
namely coherent lower and upper previsions. Our main reason for doing so is that this will 
allow us to make a more direct comparison to the more familiar Bayesian belief models, 
and in particular to de Finetti's [1974-1975J coherent previsions, or fair prices. 

Consider a gamble /. Then our subject's lower prevision, or supremum acceptable 
buying price, P_{f ) for / is defined as the largest real number s such that she accepts the 
gamble f ~t for any price f < i, or in other words accepts to buy / for any such price t. 
Similarly, her upper prevision, or infimum acceptable selling price, P{f) for the gamble / 
is the smallest real number s such that she accepts the gamble f — / for any price f > s, or 
in other words accepts to sell / for any such price t. 



^See lMoral and Wilsoij Il995ll and lOe Cooing] I2OOOI l2005ll for more details on this connection between 
natural extension and inference in classical propositional logic. 
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For an event A, the lower prevision P{Ia) of its indicator is also called the lower prob- 
ability of A, and denoted by P{A). It can be interpreted as the supremum rate for betting 
on the event A. Similarly, P{Ia) is called the upper probability of A, and also denoted by 
P{A). 

Since selling a gamble / for price s is the same thing as buying — / for price ~s, we 
have the following conjugacy relationship between an upper and a lower prevision: 

P{f) = -Pi-f)- 

This implies that from a given lower prevision P, we can always construct the conjugate 
upper prevision P, so they are mathematically equivalent belief models. In what follows, 
we shall mainly concentrate on lower previsions. 

Now assume that our subject has a coherent set of really desirable gambles then it is 
clear from the definition of lower and upper prevision that we can use ^ to define a lower 
prevision 

P^ if) = sup {s eR: f~se M} (D-LPR) 

and an upper prevision 

P,^{f)^mf{seR: s-fem 
for every gamble / on So, given ^ we can construct two real-valued functionals, 
Pj^ and P_^, whose interpretation is that of a supremum acceptable buying price, and an 
infimum acceptable selling price, respectively, and whose domain is ^{^). We shall call 
these functionals lower and upper previsions. 

We call a coherent lower prevision on ^(^') any real-valued functional on ^(^) 
satisfying the following three axioms: 
(PI) P_{f) > inf/ [accepting sure gains]; 
(P2) £(/ + §)>£(/)+ [super-additivity]; 
(P3) P{Xf) — XP{f) [non-negative homogeneity], 
for all gambles / and g on ,!^, and all non-negative real X. 

It follows from the coherence axioms (D1)-(D4) for ^ that the lowerprevision that 
corresponds to a coherent set of really desirable gambles ^ is coherentjj 

So we see that with a coherent set of really desirable gambles we can define a coher- 
ent lower prevision on ^(^), using (ID-LPRI) . We shall see further on that, conversely, 
given a coherent lower prevision P on ^{^), we can always find a coherent set of really 
desirable gambles 1% such that £ and M are related through dP-LPRI l. But unfortunately, 
the relationship between the two types of belief models is many-to-one: there are usually 
many coherent sets of really desirable gambles that lead to the same coherent lower previ- 
sion. This is why we said before that coherent sets of really desirable gambles are a more 
general and powerful belief model than coherent lower previsions. The ultimate reason for 
this is the following: suppose that a subject specifies her supremum buying price P{f) for 
a gamble /. This implies that she accepts all the gambles / — P{f) + d, where 5 > 0. But 
the specification of P{f) says nothing about the gamble / — P{f) (where 5 = 0) itself: she 
might accept it, but then again she might not. And precisely because specifying a coherent 
lower prevision says nothing about this border behaviour, it leads to a belief model that is 
less powerful than coherent sets of really desirable gambles, where this border behaviour 
would be determined. 

The dice example (cont). Let us go back to the die example. Consider, for any x in ^ = 
{!,..., 6}, the event {x} that the outcome X of rolling the die is x. If, for some real 



'To prove (PI), use (D2); for {P2) use {D3); and for (P3) use (D4) for A > and (Dl) and (D2) for A = 0. 
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number r, our subject accepts the gamble /{^} — r, she is wilHng to pay r utiles in return 
for the uncertain reward /{jj, or in other words to bet on the event {1} at rate r. So her 
lower probability P_{{x}) for {x}, or equivalently, her lower prevision Pil[x}) for /{^}, is 
the supremum rate at which she is willing to bet on {x}. This means that she accepts 
the gamble /j^j — s for any s < P{{x}). But it doesn't imply that she actually accepts the 
gamble /{^j —P{{x}): this gamble is only claimed to be almost-desirable, as we shall see 
further on. 

If she is completely ignorant about the properties of the die, her evidence about the 
die is symmetrical, i.e., doesn't change when the possible outcomes are permuted. A 
belief model that 'faithfully' captures the available evidence should therefore be symmet- 
rical with respect to such permutations as well, so we infer that in particular £({!}), . . . , 
P({6}) are all equal to some number p. Coherence [use (PI) and (P2)] then requires that 
< p < 5. Any such p leads to a symmetrical lower probability defined on the singletons, 
and therefore reflects 'symmetry of beliefs'. As we have indicated above, the model corre- 
sponding to /:> = is the one that reflects complete ignorance. We shall see further on (see 
Sections [4.21 and |9]l that the choice p — \ leads to the only model that captures the belief 
that the die is fair, i.e., that reflects 'beliefs of symmetry'. ♦ 

In order to better understand the relationship between coherent lower previsions and 
coherent sets of really desirable gambles, we need to introduce, besides real desirability, an 
new and weaker notion, called almost-desirability, which will also play an important part in 
our discussion of symmetry further on. This notion is inspired by the ideas in the discussion 
above: we say that a gamble / is almost-desirable to a subject, or that she almost-accepts 
f, whenever she accepts / + 5, or in other words / + 5 is really desirable to her, for any 
strictly positive amount of utility 5 > 0. By stating that / is almost-desirable to her, nothing 
is specified about whether the subject accepts / itself: she might, but then again she also 
might not. If we generically denote by a set of gambles that are almost-desirable to our 
subject, we see that the set of almost-desirable gambles that corresponds to a coherent 
set ^ of really desirable gambles, is given by 

^<? = {/eif(jr): (V5>o)/ + 5e^}- (D-M) 

5>0 

SO is the closure (in the topology of uniform convergence on ^(^)) of the convex 
cone ^. 

We call any set of gambles ^ that satisfies the following five axioms a coherent set of 
almost-desirable gambles: 

(Ml) if sup/ < then f ^'3 [avoiding sure loss]; 

(M2) if inf / > then f & !^ [accepting sure gains]; 

(M3) if / £ and g £ Si then / + g G S [accepting combined gambles]; 

(M4) if / e ^ and A > then A/ e & [scale invai'iance]; 

(M5) if / + 5 e ^ for all 5 > then / e ^ [closure]. 

It is a closed and convex cone in ^{^) that includes the non-negative orthant and 
does not intersect with the set = {f € ^{^) : sup/ < 0} C It is easy to see 
that the set of almost-desirable gambles Scg that corresponds to a coherent set of really 
desirable gambles ^ is actually also coherent^ 



prove (Ml), use (Dl) with S = -2^^; to prove (M2), use (D2); to prove (M3), use (D3); to prove (M4), 
use (D4); and to prove {M5), use e = | and the definition of to prove that / + 5 € for all 5 > 0. 
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It should at this point come as no surprise that coherent lower previsions and coherent 
sets of almost-desirable gambles are actually equivalent beUef models. Indeed, consider a 
coherent set of almost-desirable gambles '3, i.e., satisfies (M1)-(M5). Then the real- 
valued functional defined on (^) b>[3 

max{5 e M: /- i G ^} (M-LPR) 

satisfies (P1)-(P3) and therefore is a coherent lower prevision on 

Conversely, if we consider a coherent lower prevision P_ on ^(^), i.e., P_ satisfies 
(P1HP3), then the set of gambles 

%:={/e^(^): £(/) >0} (LPR-M) 

satisfies (M1)-(M5) and is therefore a coherent set of abnost-desirable gambles0 More- 
over, the relationships (IM-LPRl i and (ILPR-Ml i are bijective (one-to-one and onto), and they 
are each other's inverses 

Finally, consider a coherent lower prevision P_ on ^(^), and define the following set 
of gambles 

:= {/ (^^{.T): P{f) > or / > 0} . 

Then S>p U {0} is a coherent set of really desirable gambles, i.e., it satisfies (D1)-(D4)Q 
Moreover, any coherent set of really desirable gambles ^ that satisfies 

^£ U {0} C c %, 

i.e., the union of whose (relative) topological interior with is H^p U {0} and whose 
topological closure is has P as its associated lower prevision, through (ID-LPRl l. This 
confirms what we claimed before: coherent lower previsions, or equivalently, coherent sets 
of almost-desirable gambles, are less powerful belief models than coherent sets of really 
desirable gambles. If a subject specifies a coherent lower prevision P, then she actually 
states that all gambles in the union U {0} of with the relative topological interior 
of !3p are really desirable, but she doesn't specify whether the gambles in the topological 
boundary \ S^p of !Sp_ are: we only know that they are almost-desirable to her 

2.3. Natural extension for coherent lower previsions. There is one important problem 
that we skipped over in the discussion above, namely that of inference. Suppose a subject 
specifies a set 2) of gambles that are almost-desirable to her. In an elicitation procedure, 
for instance, this would typically be a finite set of gambles, so we cannot expect this set 
to be coherent. We are then, as before for really desirable gambles, faced with the prob- 
lem of enlarging this & into a coherent set of almost-desirable gambles that is as small 
as possible: we want to find out what are the (behavioural) consequences of the subject's 
almost-accepting the gambles in taking into account only the requirements of coher- 
ence. 

^'^The supremum in Eq. )D-LPR) now becomes a maximum, simply because the set ® is closed. 
' '(PI) follows from (M2), (P2) from (M3) and (P3) is a consequence of (M4). 

'^First, conditions (PI) and (P2) imply that P is monotone. Now, (P2) and (P3) imply that = P(0) > 
P{f) + P[-f) > + inf(-/). whence P(/) < sup/. From these two facts we deduce (Ml). (M2) is a 
consequence of (PI), (M3) of (P2) and (M4) of (P3). Finally, the monotonicity of P implies that P{ll) = fl for 
any constant value fl, and from this we deduce that P{f + 5) = £(/) + 5 for any 5 > 0. This implies (M5). 

'^^To see that they are each other inverses, it suffices to use that a coherent lower prevision satisfies P{f — s] = 
P{f) — s for any gamble / and any real number .v, and, conversely, that / G S>p_ if and only if P{f) > 0; this 
implies also that both transformations are bijective. 

'^For (Dl), use that a coherent lower prevision P satisfies £(/) < sup/ for any gamble /; for (D2), that / > 
satisfies either / > or / = 0; for (D3), use (P2) and the monotonicity of the coherent P, and for (D4) use (P3). 
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The smallest closed convex cone including and or in other words, the smallest 
subset of ^{^) that includes ^ and satisfies (M2)-(M5), is given by 



6^ ■ = 



^e^(jr): (V5>o)(3«>o,Aie: 



Jke&)g>'£Xkfk-5\. (M-NE) 

k=i 



This is the topological closure of the set (a^. If this convex cone S"^ intersects — 
{/ e if ( JT) : sup/ < 0} then it is easy to see that actually S"^ = S£{X), and then it is 
impossible to extend !^toa coherent set of almost-desirable gambles [because (Ml) cannot 
be satisfied]. Observe that ^5 n "^^i = if and only ilQ 



sup 



^kfk 

k=l 



> for some n >0,Xi,e M+ and /t G 



(M-ASL) 



and we then say that the set ^ of almost-desirable gambles avoids sure loss. In that case, 
and only then, we are able to extend ^ to a coherent set of almost-desirable gambles, and 
the smallest such set is precisely co^, which is called the natural extension of to a set of 
almost-desirable gambles. 

What does natural extension mean for the equivalent model of coherent lower previ- 
sions? Suppose our subject specifies a supremum acceptable buying price, or lower previ- 



sion, P_{f) for each gamble / in some set of gambles C We can then interpret 

P as a real-valued map on ,J^, and we call £ a lower prevision on J(f , and say that J(f is 
the domain of P_. 

To study the problem of natural extension for this lower prevision, we shall use what 
we already know about natural extension in the context of almost-desirable gambles. Re- 
call that specifying P_ on ^ is tantamount to stating that the gambles in the set '2 \— 
{/ — -P(/) : / G are almost-desirable. We now look at the natural extension of this Si. 
Using ( IM-ASLb . we know that such a natural extension exists if and only i fl 



sup 



Y.^k\fk-pm 

k=\ 



> for all n > 0, h G M+ and /a- G -JT, 



(LPR-ASL) 



and we then say that the lower prevision P on J(f avoids sure loss. In this case, the natural 
extension S'l^ is the smallest coherent set of almost-desirable gambles that includes ^, and 
consequently the coherent lower prevision P^m associated with S'^ through 

P^.ig):^meix{s: g-se,f?'S} 

is the point-wise smallest coherent lower prevision on ^(^) that dominates £ on J(f. We 
call this coherent lower prevision the natural extension of P and we denote it by E^p. We 
deduce from ( IM-NEb that for all gambles g on 



Kp{g) = sup inf 

k=l...,n,n>0 



8-l,h[fk-P{fk) 

k=l 



^ - L ^kgk = sup inf 
k=i lk>OJke^ 

k=\...,n,n>Q 

(LPR-NE) 

If P incurs sure loss, i.e., (ILPR-ASLb is not satisfied, then (o^ = (,^) and consequently 
E^P assumes the value +°o in every gamble. 



space. 
17 



^^Actually, this condition is equivalent to the one where we always choose Aj- = 1 . 

^^This set of gambles JfT need not have any predefined structure; in particular, it does not have to be a linear 
Here too, this condition is equivalent to the one where we always choose A,t = 1. 
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We shall call the lower prevision P_ on J(f coherent, whenever it can be extended to 
a coherent lower prevision on ^(^), or in other words, whenever it coincides with its 
natural extension E_p on every gamble in its domain ,jf^. Taking into account ( ILPR-NEl i. 
we see that this happens exactly when 



sup 



Y.^k[fk-P{fk)]-k^[h-p{.h) 

k=l 



> for all n>0,XkeR+ and fk £ Jf, 

(LPR-COH) 

This coherence condition implies that P avoids sure loss. 

Let us see if, for lower previsions, we can give a more immediate behavioural interpre- 
tation for avoiding sure loss, coherence, and natural extension. This should allow us to 
develop more intuition, as the approach we have followed so far, which motivates these 
notions through the coherence axioms for real and almost-desirable gambles, is admit- 
tedly quite abstract. We begin with avoiding sure loss. Suppose that condition dLPR-ASLb 
is not satisfied. Then there are n > 0, Ai , . . . , A„ in M+ and /i , . . . /„ in ^ such that 
sup [£,1=1 ^k [fk ~ P.{fk)\\ < 0, which implies that there is some 5 > for which 

f A,[A-P(A) + 5]<-5. 

k=l 

Now, by the definition of P[fk), our subject accepts each of the gambles fi^ — P{fk) + S, 
so she should also accept the combined gamble L^^i ^k[.fk — Eifk) + ^] [use axioms (D3) 
and (D4) for real desirability]. But this gamble leads to a sure loss of at least 5. In other 
words, if condition ( ILPR-ASLl l doesn't hold, there are gambles which the subject accepts 
and which, if properly combined, make her subject to a sure loss. 

Next, assume that condition (ILPR-COHI l fails to hold. Then there are n > 0, Ao, . . . , A„ 
in K+ and /o, . . ./„ in JT such that supE^^j h[fk-P{fk)] - Ao[/o -£(/o)]] < 0. Assume 
that Ao > 0, as we have already considered the case Aq = in our discussion of avoiding 
sure loss. Then there is some 5 > such that 

E ^ [A - tifk) + S]<fo- (£(/o) + 5). 
k=i ^ 

As before, the gamble on the left-hand side is a gamble that our subject accepts. But then 
she should also accept the gamble /o — {P{fo) + 5) since it point- wise dominates a gamble 
she accepts [use (D2) and (D3)]. This implies that she should be willing to pay a price 
P(/o) + 8 for /o, which is strictly higher than the supremum price P{fo) she has specified 
for it. Coherence avoids this kind of inconsistency. 

Finally, we turn to natural extension. Consider a gamble g on then dLPR-NEt tells 
us that E_p{g) is the supremum s such that there are n > 0, Ai, . . . , A„ in M+ and /i, . . ./„ 
in J€ for which 

g-s>f^Xk[fk-P{.fk)] 

k=\ 

Now the expression on the right-hand side is almost-desirable, because it is a non-negative 
linear combination of almost-desirable gambles [apply the axioms (M3) and (M4)]. So 
g — s should be almost-desirable as well [apply the axioms (M2) and (M3)], and therefore 
our subject should be willing to buy g for any price t < s. So we deduce that E_p{g) 
is the supremum price for g that the subject can be forced to pay for the gamble g, by 
suitably combining transactions that she is committed to accept by her specifying the lower 
prevision P on . In other words, E_p{g) is the lower prevision for g that is implied by the 
assessments in P and coherence alone. 
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2.4. Coherent previsions: the Bayesian belief models. When a lower prevision P_onJ{f 
is self -conjugate, that is, when P_{f) = P{f) for any gamble / in ,J€, it is called a prevision. 
The common val ue P{f) is then called t he prevision of /; it is a/a/r price for the gamble 



/ in the sense of IdeFinettilil 974-1 975I1 . Formally, a real-valued function on a class of 



gambles J(f is called a linear, or coherent, prevision whenever 



sup 



Z[fk-n.fk)]-i,[gj-p{gj)] 

k=\ ;=I 



> for all «,m > and fk.gj £ JT, (PR-COH) 



A linear prevision is coherent, both as a lower and as an upper prevision. Moreover, if its 
domain is the class of all gambles, ^(^), then condition (IPR-COHI ) simplifies to 

(PRl) P{f + g) = P{f )+P{g) for any / and g in ^(S") [lineai-ity]. 
(PR2) P{f ) > inf/ for any / in ^{^) [accepting sure gains]. 

Linear previsions are the familiar Bayesian belief models: any linear prev ision on all gam- 



bles is indeed a coherent prevision in the sense of ide Finettil 111974-197511 : and a prevision 



defined on an arbitrary set of gambles is coherent exactly when it is the restriction of some 
coherent prevision on all gambles. The restriction to (indicators of) events of a coherent 
prevision on all gambles is a finitely additive probability. We shall denote by P( j?^) the set 
of all coherent previsions on ^{^). 

There is an interesting relationship between coherent previsions and coherent lower 
previsions. Let P be a lower prevision with domain J^, and let us denote by 

^{P) := {p e P( jr) : (V/ e j(r)p{f) > p{f)} 

the set of all coherent previsions on ^{^) that dominate P on its domain. Then it can be 
checke43 that P avoids sure loss if and only if ^{P) is non-empty, that is, if and only if 
there is some coherent prevision on ^{^) that dominates P on J(f, and P is coherent if 
and only if it is the lower envelope of ^{P), meaning that for all P in J(f, 

P{f)=mm{P{f):Pe^{P)}. 

Also, any lower envelope of a set of coherent previsions is a coherent lower prevision. 
Moreover, the natural extension E_p of P to all gambles can be calculated using the set 
^{P) of coherent previsions: for any gamble / on we have 

Ep{f)=mm{Pif):Pe^{P)}. 

This means that from a mathematical point of view, a coherent lower prevision P and its 
set of dominating coherent lower previsions ^(P), are equivalent belief models. It can 
be checked that this set is convex and closed in the weak* topologyl3 Moreover, there 
is a bijective relationship between weak*-closed convex sets of coherent previsions and 
coherent lower previsions (their lower envelopes). This fact can (but need not) be used to 
give coherent lower previsions a Bayesian sensitivity analysis interpretation, besides the 
direct behavioural interpretation given in Section |Z21 we might assume the existence of a 
precise but unknown coherent prevision P expressing a subject's behavioural dispositions, 
and we might model the information about P by means of a weak*-closed convex set of 
coherent previsions ./# (the set of possible candidates). Then, this set is mathematically 
equivalent to its lower envelope P, which is a coherent lower prevision. We shall come 
back to the difference between the direct behavioural and the Bayesian sensitivity analysis 



'^See IWalle\ilT99ll Sections 3.3-3.4] for proofs for these statements. 

'^The weak* topology on the s et of all contin uous linear functionals on ^(.iST) is the topology of point-wise 
convergence. For more details, see IWalle-vlll99lL Appendix D]. 
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interpretation of a lower prevision in Section 1431 when we discuss the interplay between 
these interpretations and the notion of symmetry. 

Taking into account the bijective relationship that exists between coherent lower previ- 
sions and sets of almost-desirable gambles, we may also establish a bijective relationship 
between sets of coherent previsions and sets of aknost-desirable gambles: given a weak*- 
closed convex set ^ of coherent previsions on ££{3^), the class 

'Sm ■■= {/ e ^{^) : (VP G Ji)P{f) > 0} 

is a coherent set of almost-desirable gambles, that is, it satisfies the coherence condi- 
tions (M1)-(M5). Conversely, given a coherent set of almost-desirable gambles ^, the 
corresponding set of coherent previsions 

^(^) {P e P(^) : (V/ e ^)P{f) > 0} 

is a weak*-closed convex set of coherent previsions. 

Hence, there are at least three mathematically equivalent representations for the be- 
havioural dispositions of our subject: coherent sets of almost-desirable gambles, coherent 
lower previsions, and weak*-closed convex sets of coherent previsions. The bijective rela- 
tionships between them are summarised in Table [T] 

/ & P(-) ^ 

^ {/: P{f) > 0} {/: (VP e ^)P(/) > 0} 

£(■) meLx{s: -seS} min{P(-) : P e ^} 

^ {P: (V/ e ^)P{f ) > 0} {P: (V/)P(/) > P(/)} 

Table 1 . Bijective relationships between the equivalent belief models: 
coherent sets of almost-desirable gambles i^, coherent lower previsions 
P on ^{^), and weak*-closed convex sets ^ of coherent previsions 
on^(^) 



We now briefly discuss a number of belief models that constitute particular instances of 
coherent lower previsions. First, we consider n-monotone lower previsions, where « > 1. 
A lower prevision P is called n-monoton^ when the following inequality holds for all 
p(zN,p<n, and all /, /i, . . . , fp in ^( JT): 

L {~imf^Af)>o, 

/c{i p} \ iei ) 

where, here and further on, |/| denotes the number of elements in a finite set /. A similar 
definition can be given if the domain of P is only a lattice of gambles, i.e., a set of gambles 
closed under point-wise minimum A and point-wise maximum V. Suc h «-monotone lower 



previsions are particular instances of exact functionals llMaaBll2003ll . i.e., they are scalar 
multiples of some coherent lower prevision. In particular, an «-monotone lower probability 
defined on a lattice of events that contains and X is coherent if and only if P(0) = 
andP(j:') = 1. 



^°See lDe Cooman et al.l l2006ll2005bBl for a detailed discussion of n- and complete monotonicity for lower 
previsions. 
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A completely monotone lower prevision is simply one that is n-monotone for any natural 
number n > 1 . When it is defined on indicators of events, it is called a completely monotone 
lower probability. When ^ is finite, this leads to belief functions in the terminology of 
[Shafer [1976 1. 

Two particular cases of belief functions and their conjugate upper proba bilities are prob- 

abilit y charges, or finitely additive probabilities defined on a field of events I Bhaskara Rao and Bhaskara RaoL 
1 198311 and possibility measures. The latter I De Cooman. 2001, Zadeh, 1978J are set func- 



tions n satisfying n (Uie/^O = sup,g/n(A,) for any family {A;),g/ of subsets of ^ . n is 
a coherent Mpper probability if and only if n(^) = 1. 

Finally, we can consider a particular instance of a completely monotone coherent lower 
prevision that allows us to model complete ignorance, the so-called vacuous lower previ- 
sion. It is given by 

P,Af) - inf. /(^), 

for all gambles / on J^' . It corresponds to the set of almost-desirable gambles & — — 
{/ '■ f > 0}, and to the set ^ = P(^) of all coherent previsions on J^. If we have no 
information at all about the values that X takes in we have no reason to reject any 
coherent prevision P, and this leads to the vacuous lower prevision as a belief model. 
More generally, we can consider a vacuous lower prevision relative to some subset A of 
which is given by 

= inf/(x). 

A vacuous lower prevision relative to a set A is the adequate belief model when we know 
that the random variable X assumes values in A, and nothing else. The restriction to events 
of a vacuous upper prevision is a (zero-one-valued) possibility measure. 

2.5. Incomparability and indifference. We claimed in the Introduction that Bayesian 
belief models do not take indecision seriously, and that we therefore need to look at a larger 
class of belief models that do not have this defect. Here, we present a better motivation for 
this claim. 

Consider two gambles / and g on X . We say that a subject almost-prefers f to g, and 
denote this as / ^ whenever she accepts to exchange g for / in return for any (strictly) 
positive amount of utility. Given this definition, it is straightforward to check that we can 
express this in terms of the three equivalent belief models '3, P_ and ^ of the previous 
sections by 

f>g<^f-gc.& 

^P{f-8)>0 

^{ype^)P{f)>P{g). 

The binary relation ^ is a partial pre-order on ^(^), i.e., it is reflexive and transitive^] 
Observe also that f h g f — g and that / >^ O / G ^, so / is almost-preferred to 
g if and only if / — ^ is almost-preferred to the zero gamble, which in turn is equivalent to 
the fact that our subject almost-accepts f — g, i.e., that / — ^ is almost-desirable to her. 
Unless our subject's lower prevision P is actually a (precise) prevision P (meaning that 
is the semi-space {/: P{f) > 0}, and that ./# = {P}), this ordering is not linear, or total: 
it does not hold for all gambles / and g that f h g ot g f. When, therefore, both f g 



^^The binary relation is actually a vector ordering on the linear space J^{S'), because it is compatible 
with the addition of gambles, and the scalar multiplication of gambles with non-negative real numbers. 
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and g ^ f, we say that both gambles are incomparable, or that the subject is undecided 
about choosing between / and g, and we write this as / || g. 

It is instructive to see why the relation || is non-empty unless P is a precise prevision 
P. If P is not precise (but coherent), there is some gamble h such P{h) < P{h). Let x 
be any real number such that P{h) <x < P{h). In this case, the subject does not express 
a willingness to buy h for the price x, because x is strictly greater than her supremum 
acceptable price P{h) for buying h. Nor does she express a wilUngness to sell h for a price 
X, because x is strictly smaller than her infimum acceptable price P{h) for selling h. But 
there is more. Consider the gambles f := h—x (buying h for a price x) and g := x — h 
(selling h for a price x). Then it follows from the coherence of P that 

P{f -g)= 2p(h -x)= 2\P{h) - x] < and P{g - /) = 2P{x -h)=2[x- P{h)] < 0, 

so / II ^: our subject is also undecided in the choice between buying h for x or selling h for 
that price. 

We say that our subject is indifferent between / and g, and denote this as / « ^ when- 
ever both f h g and g h f- This means that P(/ — g) = P,{g — /) =0, or equivalently, 
P{f) = P{g) for all P in Clearly, « is an equivalence relation (a reflexive, sym- 
metrical and transitive binary relation) on jSf It is important to distinguish between 
incomparability and indifference. Indifference between gambles / and g represents strong 
behavioural dispositions: it means that our subject almost-accepts to exchange / for g and 
vice versa; on the other hand, incomparability has no behavioural impUcations, it merely 
records the absence of a(n expressed) behavioural disposition to choose between / and g. 



3. Monoids of transformations 

Symmetry is generally characterised mathematically as invariance under certain trans- 
formations. In this section, we provide the necessary mathematical apparatus that will 
allow us to describe and characterise symmetry for the belief models we are interested in. 

3.1. Transformations and lifting. We are interested in models for beliefs that concern 

a random variable X. So let us begin by concentrating on transformations of the set of 
possible values ^ for X. A transformation of ^ is defined mathematically as a map 
r : ^ — > ^ : X Tx. At this point, we do not require that such a map T should be onto 
(or surjective), i.e., that T{^) := {Tx: x G ^} should be equal to .ST. Neither do we 
require that T should be one-to-one (or injective), meaning that Tx = Ty implies x = y for 
all X and yin A transformation of ^ that is both onto and one-to-one will be called a 
permutation of but we shall in the sequel also need to consider transformations of 
that are not permutations. 

Suppose we have two transformations, T and 5, of ^ that are of interest. Then there 
is no real reason why we shouldn't also consider the combined action of T and S on 
leading to new transformations ST :=5or and TS:=ToS, defined by {ST)x:=S{Tx) and 
similarly TSx := T{Sx) for all x in And of course, we could also consider in a similar 
way TST and STS, or for that matter TTTSST, which we shall also write as T^S^T. So it 
is natural in this context to consider a set ^ of transformations of that is closed under 
composition, i.e.. 



(SG) 
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Such a set is called a semigroup of transformations^ If moreover the semigroup 5^ con- 
tains the identity map idf , defined by id^^x := x for all x in it is called a monoid. As 
the identity map leaves all elements of ^ unchanged, it has no implications as far as sym- 
metry and invariance are concerned, and we can therefore in what follows assume without 
loss of generality that any .^'^ we consider actually contains id,£ (is a monoid). 

A monoid is Abelian if ST — TS for all T and S in An important example of 
an Abelian monoid is the following. Consider a single transformation T of and the 
Abelian monoid generated by T, given by 

^T-={T": n>Q}, 

where := idj- is the identity map on j?^', r' := T and for n > 2, 

T" := roro---or . 

n times 

A monoid 5^ of transformations is called left- (respectively right-)cancellable when for 
every transformation T in there is some 5 in ^ such that 57" = id (respectively TS = 
id^). This transformation S is then called a te/f- (respectively right-)inverse of T. If 
=^ is both left- and right-cancellable, then the left-and right-inverses of T are unique and 
coincide for any T in ,"7, and ,"7 is called a group. Any element of ^ is then a permutation 
of J". 

For our purposes here, we generally only need to assume that 5^ is a monoid, because 
there interesting (and relevant) situations where ,"7 is not a group; this is for instance the 
case for the Abelian monoid of the shift transformations of the set of natural numbers N: 

^e:-{0": n>0}, (1) 

where d{m) = m + 1, and 9"{m) = m + n for all natural numbers m and n. Another im- 
portant example is the monoid of all transformations of which is generally not 
Abelian, nor a group. 

Since we are also concerned with gambles / on we need a way to turn a transfor- 
mation of ^ into a transformation of ^{^). This is done by the procedure of lifting: 
given any gamble / on J^, we shall denote by T'f the gamble foT, i.e., 

T'f{x):^f{Tx), 

for all ^ in For an event A, T'Ia = Ij-^iAy where T^' (A) := {x G : Tx e A} is the 
so-called inverse image of A under T . On the other hand, given a constant jx, we have 
T' 11 = pL for any transformation T . 

The following observation is quite important. Consider two transformations T and S on 
Then for any gamble / on ^ we see that 

{ST)'f = fo{SoT) = {foS)oT = (S'f) oT = T' (S'f) , 

so {STy = T'S', and lifting reverses the order of application of the transformations: for x 
in ,!^, STx means that T is applied first to x, and then S to Tx. For / in ^(^), {STff 
means that S' is applied first to / and then T' to S'f. 

Any transformation T of ^ can therefore be lifted to a transformation T' of ^{J^), 
and we denote the corresponding set of liftings by 5^'. ,'7' is then a monoid of transfor- 
mations of ^(^). Lifting preserves the most common properties of semigroups, taking 
into account the above-mentioned order-inversion: being a monoid, being Abelian, and 
being a group are preserved under lifting. But being left-cancellable is turned into being 



A semigroup is defined as a set with a binary operation tliat is internal and associative. Composition of 
maps is always an associative binary operation, and jSGt guarantees that it is internal in S. 
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right-cancellable, and vice versa. Lifting also has the interesting property that it turns a 
transformation J on ^ into a linear transformation T' of the linear space ^(^): for any 
pair of gambles / and gon and any real numbers X and /i, we have 

T\lf + iig) = XT' f + 11^8. 

li.l. Invariant (sets of) gambles. We now turn to the important notions of invariance 
under transformations. We start with the invariance of a set of gambles, because that is the 
most general notion, from which all other notions of invariance can be derived. If J(f is a 
set of gambles on ^ , and T any transformation of 3^, then we denote by 

T'Jf:^ {T'f: feJt} 

the direct image of the set under T', and we say that ^ is S/' -invariant if 

i.e., if all transformations in 5^' are internal in 

A gamble / on ^ is called ."^ -invariant if the singleton {/} is, i.e., if T' f = f for all 
transformations T in the monoid .^J'. We call an event A ^-invariant if its indicator Ia is, 
i.e., if (A) ^ A for all T in J^. 

Let us denote by J^^r the set of all ^-invariant events. It is easy to check that J^,^ 
is an ample field, i.e., it contains and and it is closed under arbitrary unions and 
complementation, and therefore also under arbitrary intersections. For any x in we 
shall call 

[x] ^ := Pi {A : A G J^s and xeA} 
the ,'7 -invariant atom containing x. It is the smallest 5^-invariant event that contains x. 
Any ^-invariant event A is a union of =$^-invariant atoms: A ~ Uxga [Asr- We shall denote 
by the set of all invariant atoms: s^,^ := {[x]^ : x g ^}. It is a partition of A 
gamble / on ,9^ is ^-invariant if and only if it is constant on the ,^-invariant atoms of 
Of course, the bigger the set of transformations S/', the smaller the number of 
invariant events (or, equivalently, the bigger the atoms [x]^). The following proposition 
relates the ^-invariant atoms [x] ^ to the images of x under the transformations in S/' . 

Proposition 1. Let ^ be a monoid of transformations of , and let x be any element of 
In general we have that {Tx: T e C [x]^. If £^ is left-cancellable, then [x]^ = 
{Tx:T(^£r]. 

Proof. Fix X in Let ^/'{x) :— {Tx: T £ for brevity of notation. Consider any T in 

Since [x]^ is T-invariant, we have that T '([x]^) — [x]^. Since x G [x]^ because 
is a monoid, we infer from this equality that Tx G [x] ^. Hence indeed ."^{x) C [x]^. 

To prove the converse inequality, assume that 5^ is left-cancellable. Consider any S 
in 3". If we can prove that 5^(x) is S'-invariant, meaning that ^^^(^^(x)) = 5^(x), then 
the proof is complete, since then ,'7{x) will be 5^-invariant, and since this set contains 
X [because id^- G ^], it must include the smallest ^f^'-invariant set [x] 5? that contains x. 
So we set out to prove that ^^'(^^(x)) = S^{x). Consider any y in ^ . First assume that 
y G c$^(x). Then there is some T m ^ such that y — Tx, whence Sy = STx G ^(x), since 
ST G 5^. Conversely, assume that y G S^^ {^{x)), or equivalently, that Sy G 5^(x), then 
there is some T in ^ such that Sy = Tx, and since is assumed to be left-cancellable, 
there is some S' in £^ such that S'S = idjf , whence ^{x) 3 S'Tx = S'Sy — y, since S'T G 
^. □ 



'So S' is a monoid of transformations of Jif. 
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An important special case is the following. Consider a transformation T of ^ , and 
the Abelian monoid = {T" : n > 0} generated by T. Then a set of gambles is J^- 
invariant if and only if T'J(f C J{f, and we simply say that ^ is T -invariant. Similarly, a 
gamble / is 5^ -invariant if and only if T'f — f, and we say that / is T -invariant. In what 
follows, we shall always use the phrase T-invariant' for '.5^ -invariant'. Also J^t is the 
set of T-invariant events, and it is an ample field whose atoms are denoted by [x]j. With 
this notation, we have for an arbitrary monoid =^ that J^^r = Clre,^ ^t- 

For instance, the particular case of the shift transformations of N given by Eq. ([T]l con- 
cerns the Abelian monoid generated by 9. Here, the only 0- (or shift-)invariant events 
are and N, and consequently a gamble / on N is 0-invariant if and only if it is con- 
stant. This also shows that the equality in the first part of Proposition [T] need not hold 
when the monoid of transformations ^ is not left-cancellable: in the present case, we 
have that 5^(hj) = {0"(m) : n > 0} = {n G N: n > m} is strictly included in the invariant 
atom [m] g = N for all m > 1 . 

Another interesting case is that of , the class of all transformations of This a 
monoid, but it is not generally a group, nor Abelian. Moreover, it is not generally left- 
cancellable. We have, for any element xof ^ that {Tx: T G } = and from Propo- 
sition[T]we deduce in a trivial manner that [x] — S^: the only invariant events under all 
transformations of are and ^ . This shows that the left-cancellability condition in the 
second part of Proposition[T]is not generally necessary. 

4. Symmetry and invariance for belief models 

We now have the necessary mathematical tools for studying the issue of symmetry in 
relation to the belief models discussed in Section |2] We shall see that for these coherent 
sets of almost-desirable gambles, there is an important distinction between the concepts 
'symmetry of models' (which we shall call weak invariance) and 'models of symmetry' 
(which we shall call strong invariance). Let us first turn to the discussion of symmetrical 
belief models. 

4. 1 . Weak invariance: symmetry of models. Consider a monoid of transformations 
of We want to express that a belief model about the value that the random variable 
X assumes in 1^ , exhibits a symmetry that is characterised by the transformations in 
Thus, the notion of (weak) invariance of belief models that we are about to introduce is in 
a sense a purely mathematical one: it expresses that these belief models are left invariant 
under the transformations in ■S/' . 

Definition 1 (Weak invariance). A coherent set of almost-desirable gambles '3 is called 
weakly -invariant if it is ^-invariant as a set of gambles, i.e., if T'^ C ^ for all T in 

Why don't we require equality rather than the weaker requirement of set inclusion in this 
definition? In linear algebra, invariance of a subset of a linear space with respect to a linear 
transformation of that space is generally defined using only the inclusion. If we recall from 
Section [3] that lifting turns any transformation T of J^T into a linear transformation T' of 
the linear space ^(^), we see that our definition of invariance is just a special case of a 
notion that is quite common in the mathematical literature. 

A few additional comments are in order. First of all, any coherent set of almost-desirable 
gambles is weakly id -invariant, so we may indeed always assume without loss of gener- 
ality that ^ is at least a monoid (contains id^ ). 
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Secondly, we have given an invariance definition for almost-desirability, but the def- 
inition for coherent sets of really desirable gambles !M is completely analogous: for all 
r in !J , T* M C SI. Observe that if S^, is weakly ,$^-invariant then the associated set of 
almost-desirable gambles ^i^, given by (ID-Ml i. is weakly 5^-invariant as well. 

Thirdly, if 5^ is a group (or at least left-cancellable), then the weak invariance condition 
is actually equivalent to T* '3 — Si for all T in 3": given a transformation T in ^ and its 
(left-)inverse 5 £ ,5^, consider / e i^; then T\S' f) — (ST)'/ ~ /, so there is a gamble 
g = S' f, which belongs to S by weak invariance, such that / = T'g; this means that 
/ G T'3, so ^ C T'3 as well. 

In summary, weak invariance is a mathematical notion that states that a subject's be- 
havioural dispositions, as represented by a belief model ^, are invariant under certain 
transformations. If we posit that a subject's dispositions are in some way a reflection of 
the evidence available to her, we see that weak invariance is a way to model 'symmetry 
of evidence'. The following examples try to argue that if there is 'symmetry of evidence', 
then corresponding belief models should at least be weakly invariant. 

The example of shift transformations. Suppose our subject is completely ignorant about the 
value of a random variable X that assumes only non-negative integer values, so ^ = N. If 
her belief model is to be a reflection of the available evidence (none), we should like it to 
be weakly invariant with respect to the shift transformations in 5e ( which is an Abelian 
monoid, but not a group). Indeed, if she is ignorant about X, she is also ignorant about 
Q{X) = X + \, apart from the fact that she knows that 0{X) cannot assume the value 0, 
whereas X can. Therefore, if our subject almost-accepts a gamble /, she should almost- 
accept 0'/: 9'f{X) — f{X + 1) may assume the same values as f{X), apart from the value 
/(O), and because of her ignorance, our subject has no reason to treat the shifted gamble 
differently. ♦ 

The dice example. Let us go back to the die example. Suppose that whatever evidence our 
subject has about the outcome X of rolling the die, is left invariant by permutations n of 
^ = {1, ■ ■ ■ ,6}. Assume that our subject almost-accepts a gamble /, meaning that she is 
willing to accept the uncertain reward f{X) + e for any £ > 0. But since the evidence gives 
our subject no reason to distinguish between the random variables X and k{X), she should 
also be willing to accept the uncertain reward f{n{X)) + e for any e > 0, or in other words, 
she should almost-accept the gamble n' f. 

We now investigate the corresponding notions for weak invariance for the equivalent 
belief models: coherent lower previsions and weak*-closed convex sets of coherent previ- 
sions. In order to do this, it is convenient to define the transformation of a (lower) prevision 
under a transformation T on ^ , by lifting T to yet a higher level. 

Definition 2 (Transformation of a functional). Let T be a transformation of ^ and let 
A be a real-valued functional defined on a T-invariant set of gambles C ^(^). Then 
the transformation TA of A is the real-valued functional defined on J(^ by TA := AoT', 
or equivalently, by TA{f) := A{T'f) = A{foT) for all gambles / in 

Tlieorem 2. Let Pbe a coherent lower prevision on ^{^), S> a coherent set of almost- 
desirable gambles, and ^ a weak*-closed convex set of coherent previsions on .if (^). 
Assume that these belief models are equivalent, in the sense that they correspond to one 



We use the same notation T for the transformation of X' and for the corresponding transformation of 
a functional, first of all because we do not want to overload the mathematical notation, and also because, in 
contrast with lifting only once, lifting twice preserves the order of application of transformations. 
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another using the bijective relations in Table\l\ Then the following statements are equiva- 
lent. 

1. & is weakly 3^ -invariant, in the sense that T' S! C '3 for all T in . 

2. P is weakly ^-invariant, in the sense that TP > P for all T in S', or equivalently 
Bj'f) > £(/) for all T in ^ and f in ^{3C); 

3. ^ is weakly cfT-invariant, in the sense that T C ^ for all T in !^ , or equivalently, 
TP G for all P in J/: and all T in ^0 

Proof. We give a circular proof. Assume that is weakly 5^-invariant. Consider any T in 
3" and / in ^ , and observe that for the corresponding lower prevision P_ 

P{T'f)=ma\{ii: T'f-jX e 3} >max{^: f - e ^} =£(/), 

where the inequality follows from the invariance assumption on 3. This shows that the 
first statement implies the second. 

Next, assume that P is weakly <5^-invariant, and consider any T in i7 and P in the 
corresponding ^ = ^{P) — {P: (V/)P(/) > P{f )}. Then for any gamble / on we 
have that TP{f) = P{T'f) > P(T'f) > P{f), where the second inequahty follows for the 
invariance assumption on P_. This tells us that indeed TP G ^{P), so the second statement 
implies the third. 

Finally, assume that ^ is weakly 5^-invariant. Consider any T in ,'7 and any gamble 
/ in the corresponding 2! = — {/: (VP G ^)P{f ) > 0}. Then we have for any P in 
^ that P{T' f) = TP{f ) > 0, since TP belongs to ^{P) by the invariance assumption on 
^ . Consequently T'/ G ^, which proves that the third statement implies the first. □ 

A coherent prevision P on ^(^) is weakly ^-invariant if and only if TP — P for 
all T in 3^. This is easiest to prove by observing that .M(P) — {P}0 So for coherent 
previsions, we have an equality in the weak invariance condition. As we argued before, we 
generally won't have such an equality for arbitrary monoids S/' , but the following corollary 
gives another sufficient condition on S' . 

Corollary 3. If the monoid is left-cancellable, then the first weak invariance condition 
in Theorem\2\becomes T' S! — 2! for all T in 3^. If -S^ is right-cancellable, then the second 
and third weak invariance conditions become TP = P and T = for all T in 

Proof. We have already proven the first statement near the beginning of Section 14.11 To 
prove the second statement, it suffices to show that when ^ is right-cancellable, 
invariance implies that P > TP and ./# C T^ for all T in Consider any transformation 
T in the monoid and let Rhea right-inverse for T, i.e., TR = id . Consider a gamble h 
on J", then P{h) = PiiJRjh) = P{R'{T'h)) > P{T'h), where the inequality follows from 
the weak invariance of P. So indeed, P > TP. Similarly, consider P in ^ . Then RP G 
by weak invariance, and for any gamble / on S", T[RP){f) = RP{T'f) = P{R'{T'f)) = 

^^This shows that our notion of a weakly invariant belief model corresponds to lPericchi and Wallevl 's Il99lll 
notion of a 'reasonable (or invariant) class of priors', rather than a 'class of reasonable (or invariant) p riors', 
the latter being what our notion of strong invariance will correspond to. On the other hand, WalleiJ ll99lL 
Definition 3.5.1] defines a .^^'-invariant lower prevision P as one for which P{T'f) = P{f) for all T E ,T and all 
gambles /, so he requires equality rather than inequality, as we do here. 

^^See Proposition Qfor a more direct proof. 

The reason for the difference in terms of left- versus right-cancellability lies of course in the fact that in the 
first condition, we work with transformations T' of gambles, and in the second and third condition we work with 
transformations T of functionals, which are liftings of the former; simply recall that lifting reverses the order of 
application of transformations. 
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P{f) since R' (T'f) = {TR)'f^ f. So there is a Q = in ^ such that P = TQ, meaning 
that P e TJ^. So indeed ^ C TJ^. □ 

We see from the definition that if a coherent set of almost-desirable gambles ^ (or a 
coherent lower prevision, or a weak*-closed convex set of coherent previsions) is weakly 
,5^-invariant, it is also weakly 5^'-invariant for any sub-monoid of transformations 3/^' C 
^. Hence, as we add transformations, the collection of weakly invariant belief models 
will not increase. The limit case is when we consider the class ,5^^ of all transformations 
on . The following theorem shows that the vacuous belief models are the only ones that 
are completely weakly invariant, i.e., weakly ^^^i -invariant. 

Theorem 4. Let i^x be the monoid of all transformations of 2^ . Then the vacuous coher- 
ent set of almost-desirable gambles ( or equivalently, the vacuous lower prevision P^, 
or equivalently, the weak*-closed convex set of all coherent previsions P(^)j is the only 
coherent set of almost-desirable gambles (coherent lower prevision, weak*-closed convex 
set of coherent previsions) that is weakly -invariant. 

Proof. We give the proof for coherent sets of almost-desirable gambles. It is obvious 
that is Jjj- -invariant. So, consider any -invariant coherent set of almost-desirable 
gambles ^. It follows from coherence [axiom (M2)] that '■^^ C Assume ex absurdo 
that "^-f C & and let / be any gamble in \ "^-f. This means that there is some xq in 
^ such that /(xq) < 0. Consider the transformation Tx^ of that maps all elements of 
^ to Xq, then r/g/ — f{x()) and it follows from the 7]iQ-invariance of ^ that the constant 
gamble /(xq) G which violates coherence axiom (MI), so ^ cannot be coherent, a 
contradiction 13 □ 

This result also tells us in particular that the vacuous belief model is always i^-invariant 
for any monoid of transformations 5^. This implies that for any monoid of transformations 
there always are 5^-invariant belief models. 

What are the behavioural consequences of weak invariance with respect to a monoid of 
transformations ^? It seems easiest to study this in terms of coherent lower previsions. 
First of all, we have that for any gamble / on ^' and any T in our subject's supremum 
buying price P{T'f) for the transformed gamble T'f should not be strictly smaller that her 
supremum price P{f ) for buying / itself. 

But there is also a more interesting consequence. Indeed, it follows from the coherence 
of P that 

P{f~T'f)<P{f)-P{T'f)<0. 



WallevI 1119911 Section 3.8.1] suggests that a subject strictly prefers a gamble / to a gamble 
g, which we denote as / :^ ^, if / > ^, or also if she accepts to pay some (strictly) positive 
price for exchanging g with /, so if P{f ~ g) > 0. This means that weak =5^-invariance 
implies that 

/ ^ T'f for all / in JT) and all T in ^ such that / ^ T'f 

which models that our subject has no reason (or disposition) to strictly prefer any gamble 
f to any of its transformations T' f that it doesn 't strictly dominate. 



A similar argument tells us that the same result holds for complete weak invariance of coherent sets of 
really desirable gambles, where now the axiom (Dl) will be violated. 
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4.2. Strong invariance: models of symmetry. Next, suppose that our subject believes 
that the (phenomenon underlying the) random variable X is subject to symmetry with re- 
spect to the transformations T in ,3^, so that she has reason not to distinguish between 
a gamble / and its transformation T' f. Let us give an example to get a more intuitive 
understanding of what this means. 

The dice example (cont.). Again, let us go back to the die example. Consider the gambles 
/{^}, forx S ^6 ■ ■ ■ ,(^}- Since our subject believes the die (and the rolling mechanism 

behind it) to be symmetrical, she will be willing to exchange any gamble /|,-} for any other 
gamble I^y-j in return for any strictly positive amount of utility: /{-^j — should therefore 
be almost-desirable to her, or in other words, in terms of her lower prevision P: 

P{I{x} ^ I{y}) > for all X and y in J?^. 

This is equivalent to stating that /{^j — should be almost-desirable, or that P{I{x} ~ 

^'^{.r}) > for all X e ^ and all permutations n of Now the only coherent lower 
prevision that satisfies these requirements is the uniform (precise) prevision, which assigns 
precise probability g to each event {x} [simply observe that for any coherent prevision P 
in ./#(-?) it follows from these requirements that -P(/{j}) = ♦ 

Let us now try and formalise the intuitive requirements in this example into a more 
formal definition. We stated above that if our subject believes that the (phenomenon un- 
derlying the) random variable X is subject to symmetry with respect to the transformations 
T in then she has reason not to distinguish between a gamble / and its transforma- 
tion T'f. Suppose she has the gamble / in her possession, then she should be willing to 
exchange this for the gamble T'f in return for any strictly positive price, and vice versa. 
This means that she should almost-accept both / — T'f and T'f — /, or in the language of 
Section |275l that she is indifferent between f and T'f: f w T'f. If ^ is her coherent set of 
almost-desirable gambles, this means that 

f-T'fe& and T'f-fe^ for all / in (^) and all T in ^. 

If we define 

^.T {f-T'f: f e ^{^),T G ^} = {T'f-f: f e ^{^),T e ^} , 
this leads to the following definition. 

Definition 3. A coherent set of almost-desirable gambles is called strongly ,^7 -invariant 
iff -T'f for all / in .if (^) and all T in .J, or equivalendy, if C ^. 

The following theorem gives equivalent characterisations of strong invariance in terms of 
the alternative types of belief models. 

Theorem 5. Let Pbe a coherent lower prevision on '2i a coherent set of almost- 

desirable gambles, and ^ a weak*-closed convex set of coherent previsions on ^( j?^). 
Assume that these belief models are equivalent, in the sense that they correspond to one 
another using the bijective relations in Table\l\ Then the following statements are equiva- 
lent: 

1. & is strongly -S^ -invariant, in the sense that S>_^ C 

2. P is strongly ^iJ'-invariant, in the sense that P{f — T' f) > and P{T'f — /) > 0, and 
therefore P{f - T'f) = P{T' f - f) = Ofor all f in if ( JT) and T in ST; 

3. .M is strongly 5^-invariant, in the sense that TP = P for all P in ^ and all T in 



^^So strongly invaiiant belief models conespond to the lPericchi and Wallevl 's Il99lll notion of a 'class of 
reasonable (or invariant) priors'. 



SYMMETRY OF MODELS VERSUS MODELS OF SYMMETRY 



23 



Proof. We give a circular proof. Assume that ^ is strongly 5^-invariant, and consider any 
gamble / on ^ and any T in ,"7 . Then we find for the associated coherent lower prevision 
P that P(/ - r7) max f-T'f-sG ^} > 0, and similarly that £(7'/-/) > 0. But 
since P is coherent, we find that also P{f - T'f) = -P{T' f - /) < -P{T' f - /) < and 
similarly P{T' f - /) = -?(/ - T'f) < -P{f - T'f) < 0, whence indeed P{f - T'f) = 
P(T'f — /) = 0, so the first statement implies the second. 

Next, assume that P is strongly 5^-invariant and consider any P in the associated set 
of dominating coherent previsions = {P: (V/)(f (/) > P{f))} and any T in Then 
for any gamble / on ^ we see that P{f — T' f) > and P{T'f — f) > 0, and since P is a 
coherent prevision, this implies that P{T'f) = P{f), so indeed TP ~ P. Hence, the second 
statement implies the third. 

Finally, assume that is strongly ^-invariant, and consider any gamble / on ^ 
and any T in 5^. Then for all P in ^ we have that P{f - T'f) = P{T'f - /) = 0, so 
both / — T'f and T'f — / belong to the associated set of almost-desirable gambles — 
{g: (VP e ^)P(g) > 0}. This tells us that the third statement imphes the first. □ 

Let us now study in more detail the relationship between weak and strong invariance. 
First of all, strong invariance implies weak invariance, but generally not the other way 
around. It is easiest to see this using weak*-closed convex sets of coherent previsions 
If ^ is strongly ^-invariant, we have that TP = P and consequently TP £ ^ for all 
P in ^ , so ^ is also weakly ^-invariant. To see that the converse doesn't generally 
hold, consider the set of all coherent previsions P(^) (the vacuous belief model), which 
is weakly invariant with respect to any monoid of transformations, but not necessarily 
strongly so, as, unless ^ contains only one element, we can easily find transformations T 
and coherent previsions P such that TP is different from P (also see Theorem|6]below). 

But the theorem above, when interpreted well, also tells us a number of very interesting 
things on this issue. First of all, we see that a coherent prevision P on ^(^S^) is strongly 
^-invariant if and only if it is weakly ^-invariant, so both notions of invariance coincide 
for coherent previsions. So anyone who insists on modelling beliefs with Bayesian belief 
models (coherent previsions) only, cannot distinguish between the two types of invariance. 
This confirms in general what we claimed in the Introduction about Bayesian belief models. 
From now on, we shall therefore no longer distinguish between strong and weak invariance 
for coherent previsions, and simply call them invariant. 

Furthermore, we see that a coherent lower prevision P is strongly 5^-invariant if and 
only if all its dominating coherent lower previsions are, or equivalently, if all its dominating 
coherent previsions, i.e., all the coherent previsions in ./# (P), are 5^-invariant. Or even 
stronger, it is easy to see that a coherent lower prevision is strongly invariant if and only if 
it is a lower envelope of some (not necessarily weak*-closed nor convex) set of invariant 
coherent previsions. 

The notions of weak and strong invariance, and the motivation for introducing them, 
are tailored to the direct behavioural interpretation of lower previsions, or the equivalent 
belief models. But what happens if we give a lower prevision P a Bayesian sensitivity 
analysis interpretation? We then hold that there is some actual precise coherent prevision 
Pa modelling the subject's uncertainty about the random variable X, that we have only im- 
perfect information about in the sense that we only know that Pa > P, or equivalently, that 
Pa e ^(P). Assume that we want the imperfect model P to capture that there is 'sym- 
metry of evidence' with respect to a monoid of transformations ,!7. The actual model P, 
then should be weakly ,:^-invariant, but since this is a (precise) coherent prevision, we can 
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not distinguish between weak and strong invariance, and it should therefore simply be S'- 
invariant: TPa = Pa for all T ^ 17. Since ^{P) is interpreted as the set of candidate models 
for Pa, all of the coherent previsions P in ^(P) must be 5^-invariant too, or equivalently 
P_ must be strongly ^-invariant. A completely analogous course of reasoning shows that 
if we want P to capture 'evidence of symmetry', P must be strongly 5^-invariant as well. 
So in contradistinction with the direct behavioural interpretation, on a Bayesian sensitivity 
analysis interpretation of P, we cannot distinguish between 'symmetry of evidence ' and 
'evidence of symmetry', and strong invariance is the proper symmetry property to use in 
both casesr^ 



As is the case for weak invariance, a belief model that is strongly .^'-invariant, is also 
strongly 5^'-invariant for any sub-monoid ^' C But in contrast with weak invariance, 
given any monoid ,'7, there do not always exist coherent belief models that are strongly 
invariant with respect to S/' . This is an immediate consequence of the following theorem, 
which makes an even stronger claim: it is totally irrational to require complete strong 
invariance, i.e., strong invariance with respect to the monoid of all transformations of 



Theorem 6. Assume that contains more than one element. Then any belief model that 
is strongly S^^; -invariant incurs a sure loss. 

Proof. We shall give a proof for lower previsions. Assume ex absurdo that P_ avoids sure 
loss, so ^{P) is non-empty. Consider any P in ^{P) and any non-constant gamble / on 
^ [there is at least one such gamble because contains more than one element] . This 
implies that there are (different) x\ and X2 in 3^ such that f{x\) ^ f{x2). For any y in 
3^ , consider the transformation Ty that maps all elements of ^ to y. Then we find that 



nf - f{y), whence P{f{y) ~ f) > P{f{y) - /) > and P{f - f{y)) > P{f - f{y)) > 0, 



since P is by assumption in particular strongly Tj.-invariant. Consequently P{f) = f{y). 
But this holds in particular for y—jci andfory = X2, so we infer that /(xi) =P{f) =f{x2). 



In fact, we easily see in this proof that given the transformation Ty that maps all elements 
of j?r to y, the only strongly Ty-invariant belief model that avoids sure loss is the constant 
prevision on y. Consequently, if we consider a monoid ^ that includes two different 
constant transformations, any belief model that is strongly ,5^-invariant incurs a sure loss. 

As a result, we see that there are monoids 3^ for which there are no strongly invariant 
coherent (lower) previsions. Under which conditions, then, are there strongly 5^-invariant 
coherent (lower) previsions? It seems easiest, and yields most insight, if we look at this 
problem in terms of sets of almost-desirable gambles: indeed if we consider a coherent 
lower prevision P on ^{^), then it is strongly =f5^-invariant if and only if for its associated 
set of almost-desirable gambles % = {/ e if (^) : P{f) > 0} we have that C 
We can consider ^,5? itself as a set of almost-desirable gambles, but at this point, we do 
not know whether !^_-^ is coherent, or whether it even avoids sure loss. Interestingly, the 
set of coherent previsions that is associated with ^,57 is given by 



So j^i^a^^ is precisely the convex and weak*-closed set of all 5^-invariant coherent 
previsions, and P_ is strongly ^fJ'-invariant if and only if ^(P) C ^{S>£r), or in other 



3^. 



a contradiction. 



□ 



J^i&s) - e P(^): (Vg e @j){P{g) > 0)} 

= {Pe P(S-) : (V/ G ^(^))(vr e ^)(p(/) = P(r7))} . 



^ See f Walle t [T991I Section 9.5] for related comments about the difference between permutability and ex- 
changeability. These notions will be briefly discussed in Section l9!2l 
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words, if and only if all coherent previsions that dominate £ are .5^-invariant. So there are 
strongly 5^-invariant coherent lower previsions if and only if ^{Sij) ^ 0, i.e., if there 
are <5^-invariant coherent previsions, and in this case the lower envelope of ^(Qij) is the 
point-wise smallest strongly 5^-invariant coherent lower prevision. 

In summary, we see that there are ,5^-invariant coherent previsions if and only if the set 
of almost-desirable gambles avoids sure lossEH which, taking into account (IM-ASLK 
is equivalent to the conditiorO 

n 

sup ^ {fk - T'Jk] > for all n > 0, /i, . . . , /„ in ^( Jf) and Tj, . . . , T„ in (2) 

k=l 

In that case, the natural extension S'^^ :~ S'^^ of to a coherent set of almost-desirable 
gambles is given b}13 

'^^= n |/e^(>^):/-e> E [fk-n/k] for some «>0,/,ei^(^),r,e^l 

£>0 I k=l J 

(3) 

This is the smallest coherent and strongly =fF-invariant set of almost-desirable gambles, 
or in other words, the belief model that represents evidence of symmetry involving the 
monoid The corresponding lower prevision, defined b\F^ 

(/) = min {Pif):Pe^{^,^)} (4) 

= max{iJ.eR: f-lJ-eS".^} (5) 

is then, by virtue of Eq. (|4|i [see also TheoremfTOlfurther on], the point- wise smallest (most 
conservative) strongly ^-invariant coherent lower prevision onJ!f{^), and if we combine 
Eqs. (O and Q, we find tha{3 



=sup|inf 



f-L[fk~nf,] 

k=l 



n>QJk^^{^')Jk^.j\. (6) 



Remember that this lower prevision is only well-defined (assumes finite real values) when- 
ever the condition (|2]l is satisfied. Taking into account Theorem [TO] further on, we de- 
duce that a coherent (lower) prevision is (strongly) 5^-invariant if and only if it dom- 
inates E_6^. Also, E_.^ is the belief model we should use if nothing else but the evi- 
dence of symmetry is given. Finally, this formula for the lower prevision is constructive, 
but usually the existence of invariant previsions (on infini te spaces) is proven in a non- 
constructive (Hahn-Banach) way; see S ection [51 and also A anew and Morse 1 1938 ll and 



Bhaskara Rao and Bhaskara Raol 1119831 Section 2.1.3(8)]. So we cannot usually get to 
the coherent invariant previsions by construction, but we can always construct their lower 
envelope explicitly! 



Also see IWallevl 's [ 199lL Lemma 3.3.2] Separation Lemma. 
"'^Observe that the set 2^ is a cone, i .e., closed und er scalar multiplication with non-negative real numbers. 

The same condition was derived bv lWallevI Il99ll Theorem 3.5.2 and Corollary 3.5.4] usmg an argument 
that works directly with coherent lower previsions. Although our argument strongly plays on the connection 
between the three equivalent types of belief models of Table[T] we believe that it produces more insight, once this 
connection is fully understood. 

Again, observe that S>,y is a cone. 
^^It is easy to see that ^ = J({Sy). 

Again, IWallevlll99lL Theorem 3.5.2 and Corollary 3.5.4] proves the same result in a different manner, see 
also footnote [33] 
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We shall have much more to say about the existence of strongly invariant belief models 
in Section |7] where we show that this existence is guaranteed in particular if the monoid 
!^ is Abelian, or if it is a finite group. The following counterexample tells us that there is 
no such guarantee for infinite groups. 

Example 1 (Permutation invariance on the natural numbers). Consider the set of all 
permutations of the set of natural numbers N. We show that there are no (strongly) 3^^- 
invariant coherent (lower) previsions on ^(N) by showing that the condition doesn't 
hold. Indeed, consider the partition of N made up of the sets 

/;5 = {3« + r: neN}, r = 0,l,2, 

and any permutations tt^ for r = 0, 1,2 such that for all n G N, 7rr(n) G ^3 if and only if 
n ^ /?3 [for instance, let tt, be involutive and such that it assigns the first element of R\ to the 
first of (^3)' , the second element of to the second of [R'-^Y , etc.] Consider the gamble 
G — Lr=ofe5 ~ '^r^ff^] oil '^h^ii ^1"^ done if we can show that supG < 0. Indeed, if 
n(^R\ then Gin) = 1 +0 + - (1 + 1 +0) = -1 for r = 0,1,2, so supG=-l. ♦ 

These results expose another fundamental difference between weak and strong invari- 
ance: while strong invariance with respect to a greater number of transformations means 
that we must refine our beliefs (i.e, it make them more precise), this is not the case with 
weak invariance. 

On the other hand, strong invariance is preserved by dominating lower previsions: if Px 
is a coherent lower prevision that is strongly ^-invariant and £2 is a coherent lower pre- 
vision that dominates Pj , then £2 is also strongly 5^-invariant. It indeed seems reasonable 
that, if a subject has evidence of symmetry, and she has some additional information that 
allows her to make her judgements more precise, she can add assessments while still pre- 
serving strong invariance. But a similar result does not hold for weak invariance: since the 
vacuous lower prevision is weakly -invariant, this would mean that any lower prevision 
should be weakly -invariant, quodnon. 

In summary, there is an important conceptual difference between weak and strong in- 
variance. Weakly invariant belief models capture in particular that a subject has no reason 
to strictly prefer a gamble / to its transformation T'/ whenever / / T'/. Strong invari- 
ance captures that a subject has reason not to distinguish between, i.e., to be indifferent 
between, the gambles / and T'/. And it is only if you insist on using Bayesian belief 
models always that you must infer indifference from having no reason to (strictly) prefer 
This is of particular relevance for belief models that try to represent a subject's complete 
ignorance, as we now proceed to show. 



5. Modelling complete ignorance 

Suppose our subject is completely ignorant about the value that X assumes in . Then 
she has no relevant information that would allow her to favour one possible value of X 
over another This implies that the corresponding belief model should be symmetric in the 
possible values of X, or in other words it should be weakl y invari an t with respect to the 



group S^sc of all permutations of 2^ . This leads to a form of lWallevf s 1119911 Section 5.5. 1] 
Symmetry Principle. 

Symmetry Principle (SP). If a subject is completely ignorant about the value of a random 
variable X in 2^ , then her corresponding belief model should be weakly invariant with 
respect to the group x of all permutations of . 
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We have mentioned before that the appropriate belief model for complete ignorance about 
X seems to be the vacuous lower prevision £jr. But SP by itself is not sufficient to single 
out this lower prevision: if, for instance, is finite, then the uniform precise prevision 
P", given by 

for each gamble / on which assigns equal probability mass 1/|^| to each element 
of is also weakly permutation invariant. We shall also see in Examples |5] and |6] of 
Section|9]that there may be many more coherent lower previsions that share the same weak 
permutation invariance property. If, however, we strengthen the Symmetry Principle to 
require weak invariance with respect to all transformations, and not just all permutations, 
then TheoremlH tells us that the vacuous lower prevision is indeed the only coherent 
lower prevision that is compatible with the following 

Strong Symmetry Principle (SSP). If a subject is completely ignorant about the value 
of a random variable X in 2^ , then her corresponding belief model should be weakly 
invariant with respect to the monoid S/'s; of all transformations of S(y . 

Wallevl lfl99ll Section 5.5 1 and note 7 on p. 526] has shown that for random variables 



X taking values in a finite set j?r, the vacuous lower prevision Pj^ is the only coherent 
lower prevision that is compatible with SP and the so-callecFl 

Embedding Principle (EP). Consider a random variable X, and consider a set of pos- 
sible values A for X. Then the (lower) probability assigned to the event A, i.e., the lower 
probability that X G A, should not depend on the set 2y of all possible values for X in 
which A is embedded. 

So under coherence, SSP is equivalent to SP and EP taken together. Under coherence, it is 
also equivalent to the following rationality principle, as we shall shortly see. 

Revised Principle of Insufficient Reason (RPIR). If you have two different gambles f 
and g on a random variable X that you are completely ignorant about, then if f ^ g you 
have no reason to prefer f to g. 

Indeed, the only coherent belief model that is compatible with this principle, is the vacuous 
one. We shall argue in terms of real desirability model^^ (see Section IZTT i. Say that a 
subject (really) prefers f to g whenever f ^ g and / — g G i.e., she accepts to exchange 
g for /. Then RPIR implies that for all / 7^ 0, / ^ implies that / or equivalently, 
by contraposition, that / S ^ implies / > 0. Hence = is the vacuous belief model. 

In summary, we have the following equivalences, under coherence, and the only be- 
lief model that is compatible with these three equivalent rationality requirements, is the 
vacuous one: 

SSP ^ SP+EP ^ RPIR. 
RPIR is a revised version of the Principle of Insufficient Reason (PIR), which states that 
if you are completely ignorant about the value of a random variable X, then you have no 
reason to distinguish between the different possible values, and therefore should consider 
all these values to have equal probability. Indeed, from a historical point of view, the PIR 



^^For additional discussion of this principle, see also lWallevI Il996bll . lwallev and Bemar j Il999ll . 

•^^ A simila r argum ent can be given for almost-desirability models S and lower previsions P, using for pref- 
erence IWallevl 's 1199 iL Sections 3.7.7-3.7.9] corresponding notion of strict preference, which coiresponds to the 
present argument by using &p U {0} as a coherent set of really desirable gambles. 
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was used extensively by Laplace (see for instance iHowid 11200211 ') to justify using a uniform 
probability for modelling complete ignorance. 

We are of course aware that our reformulation RPIR of Laplace's PIR is quite unusual 
and has little or no historical grounds, which is why we refer to it as a revised, or perhaps 
better, improved principle. It might have been preferable to call RPIR the 'Principle of 
Insufficient Reason to Prefer', but we decided against that for aesthetical reasons. 

We think that RPIR is reasonable, but that PIR isn't. Indeed, one of the reasons for the 
critical attitudes of many researchers towards 'Bayesian methods' and inverse probability 
in the nineteenth and early twentieth century seem to lie in the indiscriminate use by many 
of Laplace's PIR in order to obtain uniform prior probabilities that can be plugged into 
Bayes's formula^ And by 'indiscriminate use' we mean precisely the confusion that 
exists between symmetry of evidence and evidence of symmetry: we have argued that it 
is only evidence of symmetry that justifies using strongly invariant belief models (and in 
many cases, such as permutation invariance for finite spaces, strong invariance singles out 
the uniform probability as the only compatible belief model, see also Section|9]l. If there 
is only symmetry of evidence, we should use weakly invariant belief models, and in the 
special case of complete ignorance, vacuous ones. Of course, as we said in the Introduction 
and proved in the previous section, for precise previsions (Bayesian belief models) there 
is no difference between weak and strong invariance, so if you insist on using a Bayesian 
belief model, symmetry of evidence leads you to a (strongly) invariant one! The problem 
with the PIR, therefore, is that the belief model is only allowed to be precise: there would 
be fewer or no difficulties if in its formulation we just replaced 'probability' with 'lower 
and upper probability', for instance. 

6. Weakly invariant lower previsions 

Let us now turn to a more involved mathematical study of the invariance of coherent 
lower previsions. So far, we have only looked at coherent lower previsions that were 
defined on all gambles. But of course, it will usually happen that our subject specifies a 
supremum acceptable buying price P{f) for only a limited number of gambles /, say those 
in a subset ^ of ^{^). And then we can ask ourselves whether such an assessment 
can be coherently extended to a weakly, or to a strongly, .^J'-invariant lower prevision 
on all gambles. We shall address these, and related, problems in this and the following 
section. Let us begin here with weak invariance. The following definition generalises 
the already established notion of weak invariance to lower previsions defined on any 5^- 
invariant domain, that are not necessarily coherent (they may even incur a sure loss)Cj 

Definition 4 (Weak invariance). A lower prevision P defined on a set of gambles ^ C 
^{^) is called weakly -invariant if 

(Wl) T'f G je for all / in ^ and T in ^, i.e., je is ^-invai'iant; 

(W2) P{T'f ) > P{f) for all / in JT and T in ^, i.e., all TP point-wise dominate P. 

As before, if ^ is right-cancellable (and in particular if it is a group), the inequality in 
the invariance definition is actually an equality: consider a gamble / in J^, a transforma- 
tion T in £r and its right-inverse R, we have P{f) = PiiTRff) = P{R'{T'f)) > P{T'f) 
in addition to P{T' f) > P{f). 



An interesting historical discussion of sucli attitudes can be found in iHowid I2OO2II and lZabelll Il989bll . 
'^''Our notion of weak invariance for a lower prevision is weaker than lWalie\l 's ll99lL Section 3.5.1] corre- 
sponding notion of invariance, which requires equality, and has the drawback that it is not preserved by natural 
extension. 
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Next, because taking convex combinations, lower envelopes, limits inferior and superior 
preserves inequalities, it is easy to see that convex combinations, lower envelopes and 
point-wise limits of weakly invariant lower previsions are also weakly invariant. Observe 
by the way that the same operations also preserve coherence. 

The following proposition looks at weak invariance for (precise) previsions. 

Proposition 7. Let P be a prevision, i.e., a self-conjugate lower prevision, defined on a 
negation-invariant domain J(f = Assume that J(f is also -invariant. Then P is 

weakly ^ -invariant if and only ifP{T'f) = P{f) for all T in ^ and all f in ^ . 

Proof. It is clear that the condition is sufficient. To show that it is also necessary, assume 
that P is 5^-invariant, and consider any T m 3" and any gamble / in ^ . Then it fol- 
lows from the ^-invariance of P that on the one hand P{T^ f) > P{f), and on the other 
hand, since ~f€J(r and T'{-f) = ~T'f £ JT, thatP(-r/) =P(r'(-/)) > P{-f), or 
equivalently, using the self-conjugacy of P, that P{f) > P{T'f). □ 

We study next whether a weakly invariant lower prevision P with domain J(f can be 
extended to a coherent weakly invariant lower prevision on the set of all gambles, or more 
generally, whether there is a coherent weakly invariant lower prevision on all gambles 
that dominates P. We already know from the material in Section 12.31 that a necessary 
condition for this is that P should avoid sure loss. Indeed, if P incurs sure loss then it has 
no dominating coherent lower prevision, let alone a weakly invariant one. The perhaps 
surprising result we prove next is that avoiding sure loss is also sufficient, and that all 
we have to do is consider the natural extension E^p of P, as it preserves weak invariance. 
This natural extension is automatically guaranteed to be the point-wise smallest weakly 
5^-invariant coherent lower prevision that dominates Ppl 

Tlieorem 8 (Natural extension preserves weak invariance). The natural extension E_p of a 
weakly -invariant lower prevision Pon a set of gambles that avoids sure loss is still 
weakly 3" -invariant, i.e., for all gambles f on ^ and all T in 3, 



Consequently, E_p is the point-wise smallest weakly 3 -invariant coherent lower prevision 
on I£ that dominates P_ on its domain ,J€ . 

Proof. Consider any gamble f on and any T in 3. From the definition ( ILPR-NEl ) of 
natural extension, and the fact that T' ,Jf C ,J^, we get 



TEp{f)^Ep{rf)>Ep{f). 





(7) 



Now it follows from the r-invariance of P that P{T^gk) > Eigk)^ whence 



n n 



L h [T'gk PiT'gk)] <rY^h [g, p{gt)] , 




41. 



This result is mentioned, with only a hint at the proof, bv lWallevI ll99lL Theorem 3.5.2]. 
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and consequently f—a> Y!l=i ^k[8k~ E.{8k)] implies that 

r'/ - « > r' f h [8k-P{8k)] > E A, [T'g.-PiT'g,]] . 

k=l k=l 

So we may infer from the inequality (|7]i that 

Ep{T'f) > sup la: f^a>j^Xk[gk- P{8k)] \ - Kpif), 
4>o,aGjr [ k=i J 

k=l...,n,n>0, 

which completes the proof. □ 

Hence, if we start out with a lower prevision P on ^ that is weakly 5^-invariant and 
already coherent, then its natural extension E_p is the smallest coherent and weakly 
invariant lower prevision on all gambles that agrees with P on J{f. As we shall show 
further on, this result does not carry over to strong invariance. 

7. Strongly invariant lower previsions 

We now turn to the study of strong invariance for lower previsions on general domains. 

7.1. Definition and immediate properties. The following definition generalises the no- 
tion of strong invariance introduced in Section |4!2l to lower previsions that needn't be 
coherent, nor defined on all of ^(^). 

Definition 5 (Strong invariance). A lower prevision P defined on a set of gambles J(f C 
.jSf(<^) is called strongly ^-invariant if 

(51) r7-/e^and/-r7e^for all/in JT and alire ^; 

(52) P{T'f - /) > and P{f - T'f) > for all / in ^ and all T € 3". 

As is the case for weak invariance, it is easy to see that strong 5^-invariance is preserved 
under convex combinations, lower envelopes, and point-wise limits, simply because all 
these operations preserve inequalities. 

Proposition 9. A strongly 3^ -invariant coherent lower prevision on a S/" -invariant domain 
is also weakly 3^ -invariant. 

Proof. First of all, the coherence and strong invariance of P imply that < P{T'f — /) < 
P(T'f) - P{f), whence P{T'f) > P{f) and similarly, we derive from P(/ - T'f) > that 
P(/) > P{T'f). So we see that P is also weakly 5^-invariant (with equaUty). □ 

To see that a converse result does not generally hold, so weak invariance is actually weaker 
than strong invariance, consider the vacuous lower prevision P^ on ^(^) and the trans- 
formation Tfg that maps all elements x of j?r to xq. Then, for any gamble / such that 
inf/ < /(xq) we have P_af-;{f — T'^^f) < 0. Hence, P^- is not strongly T^rg -invariant but 
TheoremlH implies that it is weakly r^o -invariant. If we consider a finite space ^ and the 
vacuous lower prevision P j^- on ^{^') and the class of all permutations of we 
can see that weak invariance (with equality) does not imply strong invariance. 

So weak invariance is indeed a weaker notion than strong invariance. The following the- 
orem expresses the main difference between these two concepts: while the former means 
that the set of coherent previsions ^(P) is invariant, the latter means that every element 
of this set is invariant. 

Tlieorem 10. Let .J(f be a negation invariant and -invariant set of gambles such that 
T' f — f is in J(f for all f in J(f and T in ^ . 
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1. A coherent prevision P on J(f is weakly ^ -invariant if and only if it is strongly - 
invariant. In either case we simply call it ^-invariant. 

2. A coherent lower prevision P on J^T is strongly £^ -invariant if and only if all its domi- 
nating coherent previsions are (strongly) £^ -invariant on J(f. 

Proof. We Start with the first statement. We only need to prove the direct implication, so 
assume that P is weakly ^-invariant, and consider any / in . Then from the assumption 
and Proposition|2]we get P{T'f) = P{f), and it follows from the linearity of P that indeed 
PiT'f-f)=Pif-T'f)=0. 

We now turn to the second statement. Since any coherent lower prevision is the lower 
envelope of its dominating coherent previsions, the converse implications follow at once, 
since taking a lower envelope preserves strong invariance. To prove the direct implication, 
assume that P is strongly i^'-invariant, and consider any coherent prevision P in ^(P). 
For any T in .^'^ and any / in J(f we then find that 

0<P{f- T'f) < P{f - T'f) = -P{T'f - f) < -P{T'f - f) < 0, 

whence indeed P{f) = P{T'f ). □ 

7.2. Strongly invariant natural extension. We have shown when studying weak invari- 
ance that for any weakly ^f^'-invariant lower prevision P_ on some domain Jt' that avoids 
sure loss, there is a point-wise smallest weakly invariant coherent lower prevision defined 
on all gambles that dominates it: its natural extension E^p. Let us now investigate whether 
something similar can be done for the notion of strong invariance. The question then is: 
Consider a monoid of transformations of and a lower prevision P on that avoids 
sure loss, are there strongly ^-invariant coherent lower previsions on all ^(^) that dom- 
inate P_, and if so, what is the point-wise smallest such lower prevision? Let us denote, as 
before, by 

%={/e^(^):££(/)>0} 
the set of almost-desirable gambles associated with P, and by 

J^{P) = {P e P(5r): (V/ e je){P{f) > p(/))} 

its set of dominating coherent previsions, then clearly a coherent lower prevision Q on 
if( JT) is strongly =3^-invai-iantanddominatesPif andonly if ^(g) C ^(P) n.^(^,5?), 
or equivalently, S^pU C f^g. So there are strongly ,5^-invariant coherent (lower) pre- 
visions that dominate P if and only if (P) n ^(Si^) ^ 0, or equivalently, if the set of 
almost-desirable gambles Sl-p\^'3}:J avoids sure loss, and in this case the lower envelope of 
^(P) 0^(^,57), or equivalently, the lower prevision associated with the natural exten- 
sion of the set of almost-desirable gambles U i^^, is the smallest such lower prevision. 
In the language of coherent lower previsions, this leads to the following theorem0 

Theorem 11 (Strongly invariant natural extension). Consider a lower prevision P on Jif 
that avoids sure loss, and a monoid of transformations of 3^ . Then there are strongly 
1^ -invariant coherent {lower) previsions on ^{^) that dominate P on if and only if 

Ep ( E [A - Tlfk] ) > for all « > 0, /i, . . . , /„ in if ( JT) and Ti,...,T„ in ^, (8) 



I991I Theorems 3.5.2 and 3.5.3] proves similar results involving Eqs. Js) and jlOt for what we call 
weakly ^'-invariant P that avoid sure loss, in a different manner. See also footnotes |33| and |36l 
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or equivalently, if 

E.T L [fk ~ P{fi)\^ > for all n>0, and fu f„ in JT. (9) 

In that case the smallest coherent and strongly T -invariant lower prevision on ^{^) that 
dominates P on its domain is given by 



Ep^,^[f) = sup i £p / - £ [fk - rjk] : n > e if(^-),r, e ^ 



k=\ 



(10) 



sup<^£^ /-£A^[/i-£(A)] : n>0,AeJr,A^>0 



k=\ 



(11) 



for all gambles f on ^ ; and ./# {E_p is the set of all £^ -invariant coherent previsions 
that dominate P on J^. 

Proof. We already know that there is a dominating coherent (lower) prevision if and only 
if ^{P) r\^{S^_^) is non-empty. Let us show that this is equivalent to the conditions dHJ 
and (|9|l. To see the equivalence between these two conditions, it suffices to notice [use 
Eq. ( ILPR-NEI ) and the fact that Ep{h) — ~Ep{-h)] that condition © is equivalent to 



sup '£[fk-Tlfk] + Y.[gj-P{gj)] >0 
k=i j=i 

for all «,m > 0, fk G ^( J"), n e ST, gj e Jf, (12) 

and that this is in turn [use Eq. ^ and the fact that Ej{h) = —E_^{—h)] equivalent to 
condition (|9]l. But, considering condition ( IM-ASLI i. we see that condition (fTSl i holds if and 
only if the set of almost-desirable gambles S^p U !^,^ avoids sure loss, or equivalently, if 
the corresponding set of coherent previsions .4^ (P) r\^{S^_^) is non-empty. 

We now prove the validity of the expression (fTTT i for the lower envelope E^p ^ of the set 
of coherent previsions (P) n^(^j7). The proof for the expression ( fTol ) is analogous. 
We know from the material in Section|2]that this lower envelope is also the coherent lower 
prevision associated with the natural extension of the set of almost-desirable gambles '2lp\J 
<2ig', SO we get by applying Eq. (ILPR-NEI i with ^ = ^^U 'S^r that 



Ip.Af) = sup 

yt=I,...,n,n>0 



sup 

Ar=I,...,n,n>0 



sup 

e=l,...,m,m>0 



inf 



sup inf 

i=\,....m,m>0 



k=i e=i 

I n \ m 



k=l 



1=1 



sup 

;it>o,gi.G% 

k=l,...MM>0 



E,7[f-I, hgk )= sup / - ^ Xk [fk - Eifk) 



k=\ 



k=\ «,«>0 



k=\ 



for every gamble / on also taking into account the definition Q of E_ 



□ 



In conclusion, whenever the equivalent conditions ([8]l and (|9]l are satisfied for a lower 
prevision P that avoids sure loss, then (and only then) the functional E^p defined by 
Eqs. ( [Tol l and (fTTT) . is the point-wise smallest coherent and strongly 5^-invariant lower 
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prevision that dominates P. We shall call E_p g- the strongly -invariant natural extension 
of P_, as it is the belief model that the assessments captured in P lead to if in addition a 
(so-called structuralj3 assessment of symmetry involving the monoid ^ is made. 

7.3. The existence of strongly invariant coherent (lower) previsions. There is a beau- 
tiful and surprisingly simple argument to show that for some types of monoids there 
always are strongly j5^-invariant lower previsions that dominate a given lower prevision 
that is weakly 5^-invariant and avoids sure loss . It is b ased on the combination of a number 



of ideas in the literature: (D lAgnew and Morsd 1119381 Section 2] constructed some specific 



type of Minkowski functional and used this together with a Hahn-Banach extension result 
to prove the existenc e of linear f unctionals that are invariant with respect to certain groups 



of permutations; (ii) Davl 1119421 Theorem 3] showed, in a discussion of ergodic theorems, 



that a similar construction alwa ys work s for A belian semigroups of transformations; (iii) 



with crucially important insight. IWalleyl 111991, Theorems 3.5.2 and 3.5 .31 re cogn ised that 



the Minkowski functional in the existence proofs of Agnew and Morsd and lDay 



IS actu- 



ally what we have called a strongly invariant lower prevision, and he used the ideas behind 
this construction to introduce what we shall call mixture lo wer previsions in S ection |74| 



(iv) in another seminal discussion of mean ergodic theorems. lAlaoglu and Birkhoff |i ,194i 
show that (Moore-Smith-like) convergence of co nvex mixtures of linear transforma t ions is 
instrumental in characterising ergodicity; and (v) ^ Bhaskara Rao and Bhaskara Raoli 19831 



Section 2.1.3] use so-called Banach limits to generate shift-invariant probability charges. 
In this and the next section, we combine and extend these ideas to prove more general ex- 
istence results for (strongly) invariant coherent (lower) previsions, and to investigate their 
relation to (generalised) Banach limits (Section[8l). As we shall see in Section lT^liWalleyi 's 



1 19911 Section 3.5] results can then be derived from our more general treatment. 

Consider a monoid ,'7 of transformations of ^ . We can, as before, consider the set of 
lifted transformations ,'7' as a monoid of linear transformations of the linear space ^{S','). 
A convex combination T* of elements of is a linear transformation of of the 

form 

k=\ 

where « > 1, Ai, . . . , A„ are non- negative real numbers that sum to one, and of course 
T* f = Li=i ^kTlf- We denote by 3/^* the set of all convex combinations of elements of 
.T. We have of course for any two elements = hUl and T* = ^^^j ^^Vl of 3^* 
that their composition 

« I m \ n m n m 

k=\ \i=\ / k=ll=\ k=ll=\ 

again belongs to ,'7* . This implies that ,'7* is a monoid of linear transformations of ^(^) 
as well. We can now introduce invariance definitions involving transformations in ,7* in 
precisely the same way as we defined them for 5^ (or actually ,7^). We can also define, 
for any real functional A and T* e 5^*, the transformed functional T* A as AoT*. We then 
have the following result. 

Proposition 12. The following statements hold, where f is a gamble on 2^ , Ji^ is a convex 
set of gambles on 3^ , and P_ is a coherent lower prevision on Jtf : 
1. f is 3^ -invariant if and only if f is -S^* -invariant; 



^■'structural assessments are discussed in general in lWallevI ll99lL Chapter 9]. 
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2. ,yif is ^7 -invariant if and only if ,X is -invariant; 

3. P is weakly ^-invariant if and only ifP is weakly 3^* -invariant; 

4. P is strongly -invariant if and only ifP is strongly -invariant. 

Proof. It suffices of course to prove the direct implications. Consider an arbitrary T* = 
HkhTk e 3^*. For the first statement, let / be J^-invariant, then T* f = Y^khT^f = 
Y.k^kf — where the second equality follows from the 5^-invariance of /. So / is 
^*-invariant. For the second statement, let ^ be =3^-invariant and let / G then 
T*f = Hk^kTlf S because Tlf G for all k by the ^-invariance of .X' and be- 
cause is convex. So is 5^*-invariant. For the third statement, assume that P_ is 
weakly 5^-invariant. For any / G ,J^, 

p{T*f) - p ( EA,r// 1 > Y^mnf) > L^kPif) - pif), 

\ k / k k 

where the first inequality follows from the coherence of P_, and the second from the weak 
=5^-invariance of P_. Hence P is weakly 5^*-invariant. For the last statement, assume that P_ 
is strongly J^'-invariant. For any / G ,J^, 

p(Y.^kTif-.f^ =p{Y^h{nf-f)^ >^mTif-f)>o, 

where the first inequality follows from the coherence of P, and the second from the strong 
^-invariance of P. Similarly P{f — Y.k ^kT[f) > 0. Hence P is strongly 5^* -invariant. □ 

We now define the following binary relation ^ on 5^*: for Tj* and T2 in ^* we say that 
T2 is a successor of Tj*, and we write T2 ^ T^, if and only if there is some T* in such 
that 7^2* = T*T^ . Clearly ^ is a reflexive and transitive relation, because ,'^* is a monoid. 
We say that ,'7* has the Moore-Smith property, or is directed by if any two elements of 
,'7* have a common successor, i.e., for any and Tj* in ,"7* there is some T* in ,7* such 
that T* ^ Tj* and T* ^ Tj*. It is not difficult to see that if .7 is Abelian, or a finite group, 
then is directed by the successor relation. This need not hold if ,7 is an infinite group 
or a finite monoid, however 

Now, given a net a on ,7*, i.e., a mapping a : ^* R, we can take the Mo ore-Smith 



limit of a with respect to the directed set (^*,^) II Moore and Smithl 1 19221 Section I, 



p. 103], which, if it exists, is uniquely defined as the real number a such that, for every 
e > 0, there is a T* in such that |a(r*) - a] < e for all T* ^ T*. The Moore-Smith 
limit fl of a is denoted by limT*^,^* a{T*). This limit always exists if a is non-decreasing 
and bounded from above, or if a is non-increasing and bounded from below. 

Theorem 13. Let P be a coherent and weakly 3^ -invariant lower prevision on [2^), and 
assume that ^7* has the Moore-Smith property. Then for any gamble f on 2y the Moore- 
Smith limit \\vs\j*^y* PfyT* f) converges to a real number ^(/). Moreover, ^ is the 
point-wise smallest strongly .T' -invariant coherent lower prevision on that domi- 

nates P on and 




e^_^(/) = sup{£(r/): r G^*} = sup<;p| -^r//) :«>l,ri,...,r„G^ 



(13) 
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Proof. First, fix / in if ( j: ). Consider and in 3^*, and assume that ^ . This 
means that there is some T* in such that T,* = T*Ty, and consequently we find that 

P{nf)=P{T*{nf))>P{nf), 

where the inequality follows from the fact that P_ is in particular weakly r*-invariant [ob- 
serve that ^(^) is convex and that P_ is weakly 5^-invariant, and apply Proposition [T2l . 
This means that the net P{T*f), T* e ^* is non-decreasing. Since this net is moreover 
bounded from above [by sup/, since P is coherent], it converges to a real number ^ (/), 
and clearly 

Qp^,^if) - ,ar7) - sup{£(r7) : T* e ,^*} . (14) 

This tells us that the net of coherent lower previsions T*P, T* G 5^* converges point- 
wise to the lower prevision ^, so ^ is a coherent lower prevision as well [taking 
a point- wise limit preserves coherence]. Since id'^ G it follows from Eq. ( fT4l i that 
Qp ^y-{f ) > £(id'r /) = P-if)^ so Qp ^ dominates £ on ^( JT). We now show that Qp ^ 
is strongly 5^-invariant|3 Consider any / in ^( j?r) and T in 3" . Then for any n > 1, 
T^n •= \ YX=\ (r*)' belongs to 9'*, and it follows from the coherence of £ that 

P{T:{f-rf)) = -P{T'f-{T"+'yf) > -inf [r'/-(r«+i)'/] 

n n 

= --sup[(r"+i)'/-r'/] >--sup|/|, 

n n 

and consequently 

Qp^,^ if - T'f) > sup I - ^ supl/l : « > 1 1 = 0. 

A similar argument can be given for Qp ^{T'f — /) > 0, so Qp ^ is indeed strongly ST- 
invariant. 

Next, consider any strongly =f7-invariant and coherent lower prevision Q on ^{5(^), 
and assume that it dominates £. Then we get for any gamble / on ^ and any T* in 3"*: 

Q{f) = Q{f-T*f+T\f)>Q[f~T\f) + Q{T\f)>Q(J\f)>P{T^ 

where the first inequahty follows from the coherence of Q, the second inequality from its 
strong =5^-invariance [use Proposition[T2l. and the last inequahty from the fact that Q dom- 
inates £. We then deduce from Eq. ( fT4l l that Q dominates Qp So Qp ^ is indeed the 
point- wise smallest strongly ^-invariant coherent lower prevision on ^(^) that domi- 
nates £ on ^( J"). 

Finally, let us prove the second equality in Eq. ( fTSI ). Consider a gamble / and any 
£ > 0. Then, by Eq. (Qli, there is some T* in .7* such that Qp ^(/) < PjJ* f) + f . For 
this r*, there are « > 1 , Ti , . . . , r„ in 5^ and Ai , . . . , A„ > that sum to one, such that T* = 
Ylk=\ ^kTl- Let pi, . . . , p„ be non-negative rational numbers satisfying |p; — A,| < 2„g^ i^i 



^The idea for this part of the proof is due to lWallevI ll99lL Point (iv) of the proof of Theorem 3.5.3]. 
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such that moreover Y.'i'=i Pi = 10 Now it follows from the coherence of P that 
and also 

p (tiPi m'f] > t p((Pi ^i)T!f) > i inf(p- - m'f 



w=I / (=1 i=l 



n 




£ £ 

,tl 2nsup|/| 2 

whence 

Qp^,^{f)<P{T*f) + \<P 
and consequently 

Qp^^if) = sup |p 1^^^^ p,Tif^ : « > 1 , Ti , . . . , r„ e ^, pi , . . . , p„ G Q+, ^f; p,- - 1 1 , 

where Q+ denotes the set of non-negative rational numbers. Now, it is easy to see [just 
consider the least common multiple of the denominators of pi, . . . , p„] that this supremum 
coincides with the right-hand side of Eq. ( fT3] ). □ 

This result allows us to establish the following corollary. It gives a sufficient condi- 
tion for the existence of strongly 5^-invariant lower previsions dominating a given coher- 
ent lower prevision P. The smallest such lower prevision reflects how initial behavioural 
dispositions, reflected in are modified (strengthened) to E^p j when we add the extra 
assessment of strong invariance with respect to a monoid ^ of transformations. 

Corollary 14 (Strongly invariant natural extension). Let S' he a monoid of transforma- 
tions of and let P_he a weakly S/^ -invariant lower prevision on some set of gambles 
, that avoids sure loss. Assume that S^* has the Moore-Smith property. Then there are 
strongly -invariant coherent lower previsions on J^{^) that dominate P on ^{J^"), 
and the smallest such lower prevision, which is called the strongly c^T-invariant natural 
extension ofP, is given by E_p,^ = ^- Moreover, for every -invariant gamble f we 

have that E_p ^{f) = E_pif)- 

Proof. The first part of the proof follows at once from the observation that a coherent lower 
prevision Q on ^(^) dominates P on J(f if and only if it dominates E_p on all gambles. 
For the second part of the proof, simply observe that if / is a 5^-invariant gamble, then 
T*f = f and therefore Ep{T*f) = Ep{f) for all T* in □ 

Let us show in particular how this result applies when we consider the monoid 
generated by a single transformation T: 



To see that such rational numbers exist, it suffices to consider non-negative rational numbers pi , . . . , p„_i 
such that < p; < A; < 1 and |p; - A,| < , for r = 1,. . . 1, and to let p„ := 1 -^'Ci' Pi > 1 ^El'j/ ^i = 
A„ > 0. Then p„ £ [0, 1], and for n big enough, and unless we are in the trivial case where A, = 1 for some (', we 
get |p„-A„| < 
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Corollary 15. Let T be a transformation of and consider the Abelian monoid — 
{T" : n > 0}. Then for any weakly T -invariant lower prevision P on some set of gambles 
■J^ that avoids sure loss, there are strongly T -invariant coherent (lower) previsions on 
^{^) that dominate P, and the point-wise smallest such lower prevision E_p j is given by 

Proof. The existence of strongly T-invariant coherent (lower) previsions on ^{^) that 
dominate P follows from Corollary [141 and the fact that for any Abelian monoid 3^* 
has the Moore-Smith property. It also follows from this corollary that for any gamble / on 

E pjif) = sup {Ep{T*f): T* G .^j*} > sup£p {-"fiT^yf] . 



To prove the converse inequality, fix any T* in 3^* and any gamble / on Then there is 

-k=0 



some > 1 and non-negative Ao, . . . , ^n-i that sum to one, such that T* = Y^^^q Xk{T^)' 



Consider the element S*/^ — -^ L^o' {T'^J of 17*, where M is any natural number such that 
M >N. Observe that 

1 M-l /N-1 \ M-lN-l 1 M+N-2 

™ t=Q \k=0 ) e=0 k=0 ™ m=0 



where we let, for < m < M + A' - 2, 



E E = { ji ifN- 1 < m <M- 1 




\fM <m<M + N -2. 



This tells us that ^U,,, = for — 1 < m < M — 1, and < < -g for all other m. If we 
let 5m :— jim — A?+M- 1 ' it follows at once that 

A^- 1 

ifA^ - 1< m <M - 1 



\S„^\<)MiM + N-l) 

ifO<OT<A^-2orM<m<M + A^-2 

KM+N- 1 

Consequently, it follows from the weak ^-invariance and the coherence of E_p that 

EAT*f) 

<Ep{Sl,T*f) 

/ M+N-2 \ M+N-2 

= Ep[s*M+„_j+ 5,„(r")7 <££(W-i/)+ E I5™|sup|/I 



=0 / m=0 

<Kp[S*M+N_J) + SWp\f\ 



N-1 1 

(M - A'+ 1) + — — — (2A? - 2) 



-Ep{SM+N-l.f) + ^Mf\ 



M{M + N-iy ' M + N-l 

{N ~\){3M-N+\) 
M{M + N- 1) 
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Recall that / and T* , and therefore also are fixed. Consider any £ > 0, then there is some 
Me>N such that sup|/| < £ for all M > M^, whence 

Ep{T*f ) < ££(5^,+^_i/) + e < supEp{s:f) + e. 

Since this holds for all e > 0, we get Ep{T*f) < sup^y YE_p(S*f). Taking the supremum 
over all T* in 3* leads to the desired inequality. □ 

7.4. Mixture lower previsions. The condition established in Theorem[T3]is fairly general, 
and guarantees for instance the existence of ^-invariant coherent previsions whenever the 
monoid 3^ is Abelian, or a finite group. In case 3* is not directed, however, as may happen 
for instance for groups £^ that are not finite nor Abelian, there may still be 5^-invariant 
coherent previsions, as we shall see in Example |2] below. So we see that the directedness 
of 3* is not a necessary condition for the existence of .5^-invariant coherent previsions. 

But consider a weakly 5^-invariant lower prevision P defined on some domain J(^, that 
avoids sure loss. Even if 3* is not directed|3 we may still associate with P a lower 
prevision ^ on ^{^) through Eq. ( fTsT i: 

Qp„^ if) = ^sup ^ Ep{T*f) = sup jfip f r//^ : « > 1 , r, , . . . , t; e ^1 , 

where we have replaced the Moore-Smith limit by a supremum (with which it would coin- 
cide in case 3* were directed), and where E_p is the natural extension of P to all gambles. 
We shall call this lower prevision the mixture lower prevision associated with the weakly 
invariant P_. The supremum in this expression is finite, since it is dominated by sup/. This 
mixture lower prevision is not necessarily coherent, but it is still strongly 5^*-invariantQ 
Moreover, this mixture lower prevision dominates E_p, and therefore also P [observe that 
E^P is weakly invariant because P is]; and if there are ^-invariant coherent previsions, 
it is dominated by the strongly =f7-invariant natural extension E_p j of p|3 This shows 
that ^{Qp ^) = ^(E_p 5?), since all coherent previsions that dominate the strongly S^- 
invariant ^ are necessarily 5^-invariant. And clearly then, if this mixture lower previ- 
sion is coherent, it coincides with the strongly invariant natural extension. So we see that 
the mixture lower prevision, even if it is not coherent, still allows us to characterise all 
=$^-invariant coherent previsions. In particular, there are such invariant coherent previsions 
if and only if it avoids sure loss. 

Example 2 (Directedness is not necessary). Let us consider the space ^ := {1,2,3}, and 
let T\ and T2 be the transformations of 3C given by 7i(l) = 1, T\{2) = 2, Ti{3) — 2 and 
r2(l) = 1, 72(2) = 3, ^2(3) = 3, respectively. Since TiTj = Tu T2T2 = T2, T2T1 = T2 and 
T1T2 = Ti, we deduce that the set of transformations £^ = {idjf , , T2} is a monoid. Let 
be the coherent prevision on ^{^) given by ^'{i}(/) = /(I) for any gamble /, i.e., 
all of whose probability mass lies in I. Then we have P[i]{f) — P{i}{Tlf) = P[i]{T2f) for 



^^This is the general situation tliat lWallevI ll99lL Section 3.5] considers, and lie doesn't discuss the direct- 
edness of He does consider the special case that S' is Abelian for which he proves that the existence of 
invariant coherent previsions is guaranteed. The results in this section were first proven by him. 

^^Simply observe that the relevant part (near the end) of the proof of Theorem[T3]is not based on the direct- 
edness of ^* . 

To prove that the mixture lower prevision dominates P, consider T ' = id,f in its definition. To prove that 
it is dominated by the strongly invariant natural extension, take = f/n in the expression H0\ for this natural 
extension. 
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any gamble /, so is ^-invariant. Let us show that .'^* does not have the Moore-Smith 
property. 

Consider T* and T* in ST* given by T* =XT{^{\- X)T{ and 7^ = ^T/ + (1 - IL)T{, 
with X jJ.. Let T* be another element of 5^*, so there are non-negative ai, a2 and aj, 
such that ai + a2 + 0:3 = 1 and T* = ai id'^- +a2T[ + asTj. Now, 

r* r; = ai A id^^.- r/ + ai ( l - A ) id'^- 

+ a2XT{T[ + a2(l -X)T{T{ + a^lT^Tl + a3(l - A)rjr2' 

= aiAri' + ai(l -A)r2' + a2Ari' + a2(l -A)r2' + a3Ari' + a3(l -A)r2' 

= Ar/ + (i-A)r2' = ri*. 

Similarly, r*r2* = Tj* for any T* G This means that Tj* is the only possible successor 
of Tj*, and Tj* is the only possible successor of Tj*. Hence, ^* cannot have the Moore- 
Smith property. Nevertheless, there is a .^J'-invariant coherent prevision ^'{i}. 

Let us consider the vacuous, and therefore weakly 5^-invariant and coherent, lower 
prevision P 51^, on J^{^-i), and the mixture lower prevision Q ^ that corresponds with 

it. It is easy to show that for any gamble /, Q ^{f) = min{/(l),max{/(2), /(3)}} and 

this lower prevision avoids sure loss, and is therefore strongly ^-invariant, but it is not 
coherent [it is not super-additive]. It is easy to see that fjij is the only coherent prevision 
that dominates ^, and is therefore the only ^-invariant coherent prevision. ♦ 

7.5. Invariance and Choquet integration. Until now, we have explored the relation be- 
tween coherence and (weak or strong) invariance. To complete this section, we intend to 
explore this relation for the particular case of the n-monotone lower previsions and proba- 
bilities introduced near the end of Section l24l 

Consider an n-monotone lower probability P defined on a lattice of events containing 

and Then its natural extension to all events coincides with its inner set function P^, 
which is given by (A) = sup{P(B) : B e J^^,B C A}. Furthermore, the natural extension 
to all gambles is given by the Choquet integral with respect to P^: 

f /-sup/ 

Ep{f) = (C) / fdP, :=inf/+(/?) / P,({xe JT: f{x) > a})da 

- J.Sr Jinff 

for all gambles / on J^, where the integral on the right-hand side is a Riemann inte- 
gral. This natural extensio n (and therefore also the inner set function) is still n-monotone 

1 De Cooman et al. , 2005bllall . Since we have proven in Theorem [8] that natural extension 
preserves weak invariance, we can deduce that the inner set function of a n-monotone 
weakly invariant coherent lower probability, and the associated Choquet functional, are 
still weakly invariant, n-monotone and coherent. We now show that weak invariance of the 
inner set function and the associated Choquet integral is still guaranteed if the lower proba- 
bility P is not coherent or 2-monotone, but only monotone. In what follows, it is important 
to remember that for a transformation T of ^ and a subset A of T'Ia — It-^a)- 

Proposition 16. Let Pbe a weakly ■S/' -invariant monotone lower probability, defined on 
a -invariant lattice of events ,X that contains and 2y , and such that £(0) — and 
P{S^) = L Then 

1. the inner set function o/P is weakly 3^ -invariant; and 

2. the Choquet integral with respect to P^ is weakly ."^ -invariant. 

Proof. To prove the first statement, consider any A C and let B £ ,Jt^ be a any subset 
of A. Then for any T in ^, T-^{B) G Jf and T-\B) = {x: Tx e B} C {x: TxeA}^ 
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(C) //dn= inf /+(/?) / da 



T-^ (A), whence P{B) < P{T-^ (B)) <P^iT-\A)), where the first inequality follows from 
the weak invariance of P, and the second from the fact that is monotone and coincides 
with P on its domain, because P is assumed to be monotone. Consequently P„(A) = 
^^Pse.yt bcaE-W — E.*{T^^{A)). Hence, P^ is also weakly ^-invariant. 

To prove the second statement, let / be any gamble on Define, for any a in M, the 
level set /a :— {x: f{x) > a}. Then by the first statement, 

PAfa)<P,{T-Hfa))=PJix: TxGfa})=P,{{x: f{Tx) > a}) ^ ZM' f) a) ■ 
Hence, 

/-sup/ r 

< inf/+ {R) / P,((r7)„)da = (C) / T'fAP,, 

Jinff J 

also taking into account for the last equality that P^((r'/)Q;) = 1 for all a in [inf/,inf T'/), 
and that (/„ ) = for all a in (sup T'f, sup/] . □ 

As we said before, natural extension does not preserve strong invariance in general, 
and a simple example shows that this continues to hold in particular for n-monotone lower 
previsions: the unique coherent lower prevision defined on {0, ^} is trivially completely 
monotone and strongly invariant with respect to any monoid of transformations but its 
natural extension, the vacuous lower prevision P^ (which is completely monotone), is not 
strongly 5^-invariant unless in the trivial case that = {id ^*-}. 

It is nonetheless interesting that if we restrict ourselves to coherent previsions (which 
constitute a particular instance of completely monotone lower previsions), natural exten- 
sion from events to gambles does preserve strong invariance. This is a consequence of the 
following theorem. 

Theorem 17. Let Pbe a coherent lower prevision on S£{^S^) and let 3^ be a monoid of 
transformations on 2^ . Then P is strongly -invariant if and only if any P in ^(P), its 
restriction to events is (weakly) ,^ -invariant, in the sense that P{T^^{A)) = P{A) for all 
AC ^ and all T £ 

Proof. We start with the direct implication. If P is strongly =5^-invariant, then any P in 
^(P) is ^fJ'-invariant by Theorem [TOl Hence, given A C ^ and T G =5?^, we get P(A) — 
P(J-i(A)). 

Conversely, consider P in ^(P). Recall that a coherent prevision on all events has 
only one coherent extension from all events to all gam bles, namely its natural extension, 
or Choquet functional; see llPe Cooman et al. . 2005a I. So for any gamble / on ^ and 



any T in taking into account that P is assumed to be invariant on events, and that 
T^^ifa) — {T'f)a [see the proof of Proposition [T6l . we get 

Pif) = (C) / fdP = inf f+{R) P(/„)da 

/■sup/ , /sup/ 

= inf/+(P)/ P(r-i(/„))da = inf/+(P) / P((r7)„)da 

Jinf/ Jinf/ 

= (C) / T'fdP = P{T'f). 



Hence, P is strongly 5^-invariant and, applying Theorem[Tol so is the lower envelope P of 

□ 
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We see that, although the condition of strong invariance cannot be considered for lower 
probabilities, in the sense that Ia — T'Ia will not be in general the indicator of an event, it 
is still to some extent characterised by behaviour on events. Moreover, we may deduce the 
following result. 

Corollary 18. Let P be a strongly 3^ -invariant lower prevision on a ^ -invariant set of 
gambles J(f that includes all indicators of events. Assume that P avoids sure loss. Then its 
natural extension to all gambles is strongly £^ -invariant, and coincides therefore with the 
strongly invariant natural extension of P. 

Proof. Since P avoids sure loss, ^(P) is non-empty. Since P_ is strongly invariant on 
a domain that includes all events, any element P of ^{P) is (strongly) invariant on all 
events. Hence, by the previous theorem, P is also (strongly) invariant on all gambles, 
since a coherent prevision on all events has only one coherent extension from all events to 
all gambles (namely its natural extension, or Choquet functional). Therefore, the natural 
extension of £ is a lower envelope of invariant coherent previsions, and is therefore strongly 
invariant. □ 

This result provides further insight into the existence problem for strongly invariant co- 
herent lower previsions. The existence of strongly invariant coherent lower previsions on 
all gambles is equivalent to the existence of invariant coherent previsions on all gambles, 
which in turn is equivalent to the existence of invariant coherent previsions on all events 
(or in other words, invariant finitely additive probabilities). And it is the impossibility of 
satisfying invariance with finitely additive probabilities in some cases (for instance for the 
class of all transformations) that prevents the existence of coherent strongly invariant 
belief models. 

We also infer that if the restriction 2 of a coherent lower prevision P on J^{3^) to 
gambles of the type Ia — T'Ia and T'Ia — Ia, involving only indicators of events, is strongly 
invariant, then P is strongly invariant on all of ^(^): it will dominate the natural exten- 
sion E_Q of Q, which is strongly invariant by Corollary[T8] and consequently it will also be 
strongly invariant. 

We can also deduce the following result. Recall that a linear lattice of gambles J(f is 
a set of gambles that is at once a lattice of gambles and a linear subspace of ^(^). If 
in addition contains all constant gamble s, then for an y coherent prevision P defined 
on .Ji^, its natural extension to all gambles li Wallevi[l991 , Theorem 3.1.4] is given by the 



inner extension P*{f) :— sup {P{g) : g G J(^,g < /}. Let us denote by P* the conjugate 
upper prevision of f*. 

Corollary 19. Let ^ be a monoid of transformations of 3^ , and let P_be a strongly 
invariant lower prevision on a linear lattice of gambles ^ that contains all constant 
gambles. The natural extension E^, of P_ to all gambles is strongly -invariant if and 
only if for any coherent prevision P on that dominates P, we have f*(A \ T^' (A)) = 
P*{A\T-\A))=P^{T-\A)\A)=P*{T-'^{A)\A)forallA C 3^ andallT e ST. 

Proof. It follows from'Wallev' f 199l', Theorem 3.4.2] that E_p is the lower envelope of the 
coherent lower previsions f*, where P is any coherent prevision on Jif that dominates P_ 
on J?r. But then, clearly, E_p will be strongly ^-invariant if and only if all the f* are. 
Consider any such P*. By Theorem[T7] is strongly invariant if and only if for all A C ^ 
and r G ^: 

2(A) = Q(T-\A)) for all Q in ^(P*) 
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which is obviously equivalent to P, [Ia — T'Ia) — P* {T'Ia ~Ia) =0. Now observe that Ia — 
T'Ia=Ia -h-HA) = 4\r-i(A) ~h-^(A)\A^ and that the functions Ia\t-Ha) and -/7-1 (a)\a 
are comonotone. Since P is a coherent prevision on it is completely monotone. Hence, 
its inner extension is coherent and completel y monotone on all gambles, and therefore 



comonotone additive BDe Cooman et al.L l2005all . This means that 



P*{Ia - T'Ia) - P*{Ia\t-\a) '~h-^(A)\A) — P*i^A\T-^{A)) + P*i^h-^{A)\A) 

1\7-1(A)) ^'f*(^7-l(A)\A 



= P*ilA\T-HA)) - P*ilT-HA)\A) =P*iA\ {A))-P*{T-' (A) \ A) 



and similarly P^{T'Ia - h) = P*iT-\A) \ A) -P*{A\ T"' (A)). The rest of the proof is 
now immediate. □ 

8. Shift- iNVARiANCE and its generalisations 

8.1. Strongly shift-invariant coherent lower previsions on j2f{N). Let us consider, as 

an example, the case of the shift-invariant, i.e., 5^ -invariant, coherent previsions on Ji'(N). 

These are usually called Banach l imits in the literature, see for instance. iBhaskara Rao and Bhaskara Rao 



1119831 Section 2.1.3] or lWallevI 1119911 Sections 2.9.5 and 3.5.7]. We know from Corol- 
lary [14] that there are always Banach limits that dominate a given weakly shift-invariant 
lower prevision — so we know that there actually are Banach limits. Let us denote by 
Pg (N) the set of all Banach limits. We also know that a coherent lower prevision on ^{N) 
is strongly shift-invariant if and only if it is a lower envelope of such Banach limits. The 
smallest strongly shift-invariant coherent lower prevision E_q on ^(N) is the lower enve- 
lope of all Banach limits, and it is given by|3 

Eg{f)= sup inf if /(A: + m,)=lim inf i'^^Vw, (15) 

for any gamble / on N (or in other words, for any bounded sequence /(n)„GN of real num- 
bers). The first equality follows from Corollary [141 and the second from Corollary [15] 
Keif) is obtained by taking the infimum sample mean of / over 'moving windows' of 
length n, and then letting the window length n go to infinity. Since this is the lower pre- 
vision on ^(N) that can be derived solely using considerations of coherence and the evi- 
dence of shift-invariance, we believe that this E_g is a natural candidate for a 'uniform dis- 
tribution' on N. It is the belief model to use if we only have evidence of shift-invariance, 
as all other strongly shift-invariant coherent lower previsions will point-wise dominate E_q, 
and will therefore represent stronger behavioural dispositions than warranted by the mere 
evidence of shift-invariance 

We could also sample / over the set {!,..., n} leading to a coherent 'sampling' previ- 



sion 



"^^See also[miie3 Il99ll Section 3.5.7]. The expression on the right hand side is not a limit inferior! 
^''BuI this belief model has the important defect that, like the lower prevision Sg defined further on, it is not 



fully conglomerable; see IWalle\< I1991I Section 6.6.7] and observe that the counterexample that Walley gives 
for Sg, also applies to £g. Walley 's remark there that his example shows that there are no (what we call) fully 
conglomerable (strongly) shift-invariant (lower) previsions that dominate Sg , can be extended in a straightforward 
manner to Eg to show that there are no fully conglomerable (strongly) sfiift-invariant (lower) previsions. 
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but the problem here is that for any given / the sequence of sampling averages is not 
guaranteed to converge. Taking the limits inferior of such sequences (one for each gamble 
/), however, yields a coherent lower previsior0 Sq given by 




for any gamble / on N. For any event A C A^, or equivalently, any zero-one-valued se- 
quence, we have that S„{A) = -|A n {0, . . . ,« — 1}| is the 'relative frequency' of ones in 
the sequence /^(n) and 

Sq{A) = liminf5„(A) = liminf-|A n {0, . . . ,n - 1}|. 

Let So denote the conjugate of Sg, given by S0{f) — limsup„ 5'„(/). Those events A for 
which Sq{A) — 58(A) have a 'limiting relative frequency' equal to this common value. It 
is not difficult to show that the coherent 'limiting relative frequency' lower prevision Sg 
is actually also strongly shift-invariantQ This implies that all the coherent previsions that 
dominate Sg are strongly shift-invariant. But it is easy to see (see Example |3] below) that 
E_g is strictly dominated by Sg, so there are Banach limits that do not dominate Sg. 

Proposition 20. Let L be any Banach limit on ^(N), let f be any gamble on N. Then the 
following statements hold. 

1. liminf„^„/(«) < Egif) < Sgif) < Sgif) < Eg{f) < limsup„_/(«). 

2. Iflim„^„f{n) exists, then 

Eg{f)^Sg{f)=Eg{f)=Sg{f)=L{f) = lim/(«). 

n—foo 

3. If f is 0'^^ -invariant (has period m>\), then 

1 m— 1 

Eg{f) ^Sg{f)^Eg{f)=Sg{f)=m = - £ /W- 

r=\ 

4. If f is zero except in a finite number of elements of N, then E_g (/) = S_g (/) = Eg{f) = 

if) ~ E{f) = 0. In particular, this holds for the indicator of any finite subset A of N. 

Proof. We begin with the first statement. By conjugacy, we can concentrate on the lower 
previsions. We have already argued that Sg is a strongly shift-invariant coherent lower pre- 
vision, so Sg will dominate the smallest strongly shift-invariant coherent lower prevision 
E_g . So it remains to prove that E_g dominates the limit inferior. Consider the first equality 
in Eq. (fTSl l. Fix the natural numbers n > 1, nii, . . .m„. We can assume without loss of 
generality that the mi is the smallest of all the m(. Observe that 

inf - > fik + nif) > inf min f(k + mi) ~ min inf f(k) — inf f(k), 
k>0 n f^^ ^ ' ~ k>0 e=i ^ ' i=l k>me ^ ' k>mi ^ ' 

and therefore 

Keif) > sup inf /(fc) = liminf/(«). 



A limit inferior of a sequence of coherent lower previsions is always coherent, see IWallevI ll99lL Corol- 
lary 2.6.7]. 

^^The following simple proof is due to lWallevI Il99ll Section 3.5.7]. Observe that S„{9'f-f) = [f{n) - 
f{0)]/n ^ as « ^ ~ so SgiO'f - f) =Sg [e'f-f) = 0. 
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The second statement is an immediate consequence of the first, and the third follows 
easily from the definition of E_q and Eg. Finally, the fourth statement follows at once from 
the second. □ 

Example 3 (Not all Banach limits dominate Sq). Consider the event 

A= {n^ + k: n>l,k^O,...,n~l}. 

Then A has 'limiting relative frequency' Sg{A) — 5e(A) = 1/2, whereas E_q{A) = and 
Eg{A) — 1. This shows that S_g strictly dominates E_g, so not all Banach limits dominate 
Sg. 

Indeed, for the limiting relative frequency, consider the subsequence 5„2_|(A), m > 2 
of 5„ (A). Then 

S,„2 |(A) = — - An{0,...,m^-2} = = — ; = ^-^5 — -- = -, 

so this subsequence converges to j. Now the 'integer intervals' [m^ — 1, (m + 1)^ — 1], 
m > 1 cover the set of all natural numbers, and as n varies over such an interval, S„{A) 
starts at 5,„2_i(A) = < increases to 5„,2+,„(A) = = i, and then again 

decreases to 'S(,„^i)2_j(A) = j ^Jt^^ < j- Both the lower and upper bounds converge to j 
as m oo, and therefore the sequence S„{A) converges to j as well. 

To calculate E_g{A), we consider the second equality in Eq. (fTSl l. Fix n > 1 and let 
k^n^ + n, then fc + n - 1 = (n + 1)^ - 2, so 

1 k+n-l 1 («+l)^-2 

whence infi.>o ^ /^(^) ~ for all n > 1, and therefore £9 (A) = 0. To calculate 

Eg{A), fix n> 1 and let k — n^ then 

1 k+n—l 1 I 

whence sup^,>o i L^i^" ' (^) = 1 for all n > 1 , and therefore £9 (A) = 1 . 4 

In an interesting paper. iKadane and O'Hagaii 1 1995 1 study candidates for the 'uniform 



distribution' on N. They consider, among others, all the finitely additive probabilities 
(or equivalently, all coherent previsions) that coincide with the limiting relative frequency 
on all events for which this limit exists. One could also consider as such candidates the 
coherent previsions that dominate the sampling lower prevision S_g, which have the benefit 
of being strongly shift-invariant. But, we actually believe that all Banach limits (or actually, 
their lower envelope) are good candi dates for being called ' uniform distributions on N' and 



not just the ones that dominate Sg. iKadane and O'Hagan also propose to consider other 



coherent previsions, and their idea is to consider the 'residue sets', which are the subsets 

7;;„ = {km + r: k>0} = {£€N: £ = r mod m} 

of N, where m > 1 and r ~ I, ... ,m — I. These sets are 0'" -invariant, so we already know 
from Proposition |20] that E»( Rl) = SgjRl,) =Sg( R':„) = Eg{R^„) = ^ for all m > 1 and 
r=l,...,m— 1. Now what iKadane and O'HaganI do, is consider the set of all coherent 
previsions (finitely additive probabilities in their paper, but that is equivalent) that extend 
the probability assessments P{R^„j) — 1 /m for all events In other words, they consider 
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the natural extension Ej^^ of all such assessments, i.e., the lower envelope of all such 
coherent previsions. It is not difficult to prove that this natural extension is given b}|3 



2 777—1 

£res(/) = lim - ^ inf /(^OT + r). 

This coherent lower prevision is completely monotone [as a point- wise limit of completely 
mono tone lower pre visions, even (natural extensions to gambles of so-called) belief func- 
tions iShafei , I976ll 1. and weakly shift-invariant [since the natural extension of any weakly 
shift-invariant lower prevision is]. Since the assessments = ^ coincide with the val- 

ues given by E_g , we see that E_q will point-wise dominate the natural extension fi^g^ of these 
assessments to all gambles. But as we shall shortly prove in ExampleH] K^^^ is not strongly 
shift-invariant, meaning that among the coherent previsions that extend these assessments, 
there also are coherent previsions that are not Banach limits (not shift-invariant). 

Example 4. Here we show by means of a counterexample that Ej-^^ is not strongly shift- 
invariant. Let Z?„, := {0, . . . ,m — 1} and A := [ji„>i{m} x B,„, and consider the map 

in(in — 1 } 

(j): A^N: {m,r)^ 'l>{m,r) - + r+l. 

It is easy to see that is a bijection (one-to-one and onto). Also define the map 

fc: A ^ N: {m,r) i-^ K{m,r) := Nm(p{m,r) + r. 

for some fixed N >2. We consider the strict order < on A induced by the bijection (p, i.e., 
{in,r) < {m',r') if and only if 0(m,r) < ^{m' ^r') [if and only if m < m, or m = m' and 
r < r' , so < is the lexicographic order]. Then K is an increasing map with respect to this 
order To see this, assume that (m,r) < (m',r'). If m < m' , then 

K{m,r) — Nm(j){m,r) + r < Nm(p{m' ,0) + r 

< Nm ^{m ,Q) + Q < Nm ^{m ,r') + r =K{m,r). 

If on the other hand m — m' and r < r', then K{m, r) — Nm(p (m, r) + r < Nm(p (m, r') + r' = 
(c(m,r'). 

Moreover, given (m, r) < {m' ,r'), we see that K{m' y ) — K{m,r) >N. Indeed, since K is 
increasing, it suffices to prove this for consecutive pairs in the order < we have defined on 
A. There are only two possible expressions of consecutive pairs (m,r) and {m' ,r'): either 
we have (m',r') = {m,r+l), and then we get 

K{m,r+ 1) — K{m,r) —Nm[(j){m,r+ 1) - (j){m,r)] + 1 =Nm+ I >N; 

or we have r = m — 1 , {m', r') (m+l, 0), and then we get 

K{m+ 1,0) — K(m,m - 1) =Nm[^{m+ 1,0) -(j){m,m - 1)] +N(j){m+ 1,0) - (;7i - 1) 

= Nm+N<j){m+ 1,0) - {m-l)>Nm>N, 

taking into account that (m + 1 , 0) > w — 1 by definition of . 

Consider the set C = k{AY. Then Ej.g^{C) — lim„,^oo ^ T!"=o ^^^keNlcikm + r). Since 
for every m G N and r G the value K{m, r) — Nmcj) {m,r) +r does not belong to C, we 
deduce that ^ inf/teN {km + r) = for all m, and consequently Ej.^^ (C) — 0. 

On the other hand, Eg{C) — Hm„^ooinf<;>o ^L('^^^^c(^)- Since by construction any 
two elements in k{A) differ in at least elements, we deduce that infj^>o ^ T!(=k~^ ^c(^) > 
1 — and this for all « e N. This implies that E_g{C) > 1 — > 0. Hence, Ej.^^ is 



See lDe Cooman et alj l200dl for a proof. 



46 



GERT DE COOMAN AND ENRIQUE MIRANDA 



Strictly smaller than the smallest strongly shift-invariant natural extension E_q, and there- 
fore not strongly shift-invariant. ♦ 

8.2. Strong r-invariance. Now consider an arbitrary non-empty set Also consider a 
transformation T of ^ and the Abelian monoid ^ — {T" : n > 0} generated by T . We 
shall characterise the strongly T-invariant coherent lower previsions on ^(^) using the 
Banach hmits on (N). 

First of all, consider any coherent lower prevision P on ^{^), and any gamble / on 

. Define the gamble //> on N as 

fp{n):=P{{r)"f)=P{foT"). (16) 

[This is indeed a gamble, as for all n we deduce from the coherence of P that //>(«) — 
P(/or") < sup[/or"] <sup/and similarly /^(n) > inf/.] On the one hand (r7)7(n) = 
P{T'fo T")) = P{T'{foT")) = fjAn) and on the other hand {T'f)p{n) = P(/o F'+i) = 
fpin + l)=fpi9n), so 

{T'f)p=fTP^e'fp, (17) 

and this observation allows us to establish a link between the transformation T on ^ and 
the shi ft transformation 9 on N. This makes us think of the following trick, inspired by 
what Bhaskara Rao and Bhaskara Raol 1 1983 , Section 2.1.3(9)] do for probability charges. 



rather than coherent lower previsions. Let L be any shift-invariant coherent prevision on 
^(N), or in other words, a Banach limit on ^(N). Define the real-valued functional P^ 
on ^(^) by Eiif) '■= L{fp)- We show that this functional has very special properties. 

Proposition 21. Let Lbe a shift-invariant coherent prevision on ^(N), let Pbe a coherent 
lower prevision on ^{^), and let T be a transformation of 3^ . Then the following 
statements hold. 

1. P]^ is a weakly T -invariant coherent lower prevision on ^[^) (with equality). 

2. If P dominates a weakly T -invariant coherent lower prevision Q on .^{J^), then P^^ 
dominates Q. 

3. IfP = P is a coherent prevision, then Pl is a (strongly) T -invariant coherent prevision 
on^{5C). 

4. If Q is a weakly T -invariant coherent lower prevision on J^(^), then the (strongly) 
T -invariant coherent prevision Pl dominates Qfor any P in ^(Q). 

5. IfP — P is a T -invariant coherent prevision, then Pl ~ P. 

Proof. We first prove the first statement. Consider gambles / and g on Since inf/ < 
fp, it follows from the coherence of L that inf / < L(//>) = Piif)- Moreover, we have for 
any « in N that 

{f+8)pin)^P{{f+g)oT")^P{foT"+goT")>P{foT")+P{goT")=fp{n)+gp{n), 

where the inequality follows from the coherence [super-additivity] of P. Since L is coher- 
ent, we see that Pilf + g) > L(fp)+L(gp) = PL{f )+PL{g)- Finally, for any A > 0, we 
have that {Xf)p(n) = p({Xf) o T") = £(A(/o T")) = AP(/o r«) = Xfp(n), since P is 
coherent. Consequently PiiXf) = L{X fp) ~ XL{fp) — XPi{f), since L is coherent. This 
proves that is a coherent lower prevision on J^{3^) [because (P1)-(P3) are satisfied]. 
To show that it is weakly T-invariant, recall that {T' f)p = 9' fp, whence 

p^{rf)^L{(rf)p)^L{9'fp)^L{fp)^p^{f), 

since L is shift-invariant. 
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To prove the second statement, assume that P dominates the weakly T-invariant coher- 
ent lower prevision Q on ^(^). Then for any gamble / on ^ , we see that 

fp{n) =P{foT")> QifoT") > Qif), 

where the last inequality follows from the weak T-invariance of Q. Consequently, since L 
is coherent, we get = L{fp) > Q{f). 

The third statement follows immediately from the first and the fact that Pl is a self- 
conjugate coherent lower prevision (and therefore a coherent prevision) because P and L 
are. 

The fourth statement follows at once from the second and the third. The fifth is an 
immediate consequence of the definition of P^. □ 

We can use the results in this proposition to characterise all strongly T-invariant coher- 
ent lower previsions using Banach limits on ^(N). 

Theorem 22. Let P be a weakly T-invariant coherent lower prevision defined on some 
T-invariant domain J(f, that avoids sure loss. Then the set of all T-invariant coherent 
previsions on J^{^) that dominate P on is given by 

{Pl: Pe^{P) andLePeiN)}, 

so the smallest strongly T-invariant coherent lower prevision E_pj on ^(^) that domi- 
nates P, i.e., the strongly T-invariant natural extension of P, is the lower envelope of this 
set, and also given by 



Epr{f)= inf En(fp)^ inf supinf 



1 k+n-\ 



" l=k 



for any gamble f on ^ . As a consequence, the set Pl(^) of all T-invariant coherent 
previsions on J^{^) is given by 

Pt{^) = {Pl: P eP and L ePe{N)} . 

This tells us that all T-invariant coherent previsions can be constructed using Banach 
limits on ^(N). The smallest strongly T-invariant coherent lower prevision Ej on ^{^) 
is the lower envelope of this set, and also given by 

k+n-\ 



Erif) ~ inf Eg(fp) — inf supinf 

PeV(.r) Pe¥{.sr)„>ik>0 



1 K+n—i 

- 1 piiT'yf) 



t=k 



for any gamble f on ^ . 



Proof. First of all, a coherent prevision P on ^(^) belongs to ^{P), i.e., dominates 
P_ on its domain ,J^, if and only if P dominates the natural extension E_p on all gambles. 
Moreover, E_p is weakly T-invariant by Theorem [8] Now consider any P E (P). Use 
the above observations together with Proposition ISTI [statements 3 and 4] to show that for 
any Banach limit L on ^(N), Pl is a T-invariant coherent prevision that dominates P. 
Conversely, if P is a T-invariant coherent prevision on ^{^) that dominates P on J(f, 
then by Proposition|2T| [statement 5], P = Pl for any Banach limit L on ^{N). This shows 
that {Pl: P G ^{P),L G P0(N)} is indeed the set of T-invariant coherent previsions on 
that dominate P on Consequently, E_pL is the lower envelope of this set, 
whence for any gamble / on ^ 

EpT-if)^ inf inf Pz,(/) = inf inf L{fp) 

- Pe.^{P)LePe{N} ' Pe.^iP]L€¥g{N) 
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and since E_q is the lower envelope of Pg (N), 



inf Enlfp) 



and using Eqs. ( fTsT i and ( fT6] l. 



inf sup inf 

/'G^(£)„>li>0 



1 k+n-\ 

- L ^'((rO'/) 



The rest of the proof is now immediate. □ 

8.3. Generalised Banach limits. The above results on monoids S/j generated by a single 
transformation T can be generalised towards more general monoids ."^ of transformations 
of ^ , such that the set of convex mixtures of the lifted linear transformations in 5^' 
is directed by the successor relation ^ on The following discussion establishes an 
interesting connection between strong invariance and the notion of a generalised Banach 
limit. 

We can consider =^5^* as a monoid of transformations of itself, as follows: with any 
element T* we associate a transformation of 5^*, also denoted by T*, such that T* {S*) := 
S*T* e Sr*, for any S* in J^*0 We can, in the usual fashion, lift J* to a transformation 
(r*)' onif(=^*) by letting (r*)'^ = T*, or in other words 

(ryg(5*)=g(r(5*))-g(5*r), (is) 

for any S* in S^* and any gamble g on i.e . , g g £ £{3'*). 

Now a generalised Banach limit ISchechter , I1997'. Sections 12.33-12.38] on ^{^3'*^ is 



defined as any linear functional on ££{^3*) that dominates the limit inferior operator with 
respect to the directed set 3* . Let us take a closer look at this limit inferior operator It is 
defined by 

Hminf p = liminf ?(r*) := sup inf ^(r*), 

for any gamble g on 3*. Now recall that T* ^ S* if and only if there is some R* in 3* 
such that r* = R*S*, so we get, using Eq. ([TSll, that 

liminfg(r*)= sup inf g(R*S*)^ sup inf (S*yg(R*)^ lim P^J(S*yg), 

where £3?* is the vacuous lower prevision on ^{3*}. If we look at Corollary [14] for 
the special case ^ = 3* and the monoid of transformations 3*, recall that we need 
to lift transformations in 3* before we can apply them to gambles, and that the lifted 
transformations of 3* already constitute a convex seQ we easily get to the following 
conclusion. 

Proposition 23. The limit inferior operator on ^(^3*) is actually the point-wise smallest 
strongly 3* -invariant coherent lower prevision on and the generalised Banach 

limits on J£'{3*) are the 3* -invariant coherent previsions on ^{3*). 



Usually, T*{S*) is defined as T*S*, see for instance lWaiie^ |l99l'. Note 1 of Section 3.5.1]. But we have to 
take a different route here because the elements of ,9"* are convex mixtures of lifted transformations, and as we 
have seen, lifting reverses the order of application of transformations. 

^^In general, even if ,T' is directed by the successor relation ^, the limit inferior operator on J^[S') will not 
be strongly invariant. But convexification, or going from ,5" to ,5^*, makes the limit inferior strongly invariant. 
Observe in this respect that the limit inferior operator on ^(N) is not strongly shift-invariant, but its 'convexified' 
counterpart £g is. 
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We can now apply arguments similar to the ones in the previous section, for general 
monoids 5^ of transformations of X such that 3"* is directed. Consider any coherent 
lower prevision P on ^(^) and any gamble /, and define the following gamble //> on 

MS*):^PiS*f) 

for any S* in 3*, which generalises Eq. ( fTSI l. Observe that, using Eq. ( fTsT l. 

{T*f)p{S*)^PiS*T*f)^fpiS*T*) = iT*yfp{S*), 

so 

iT*f)p = {T*Yfp, 

which generalises Eq. ( fTTI i. If we consider any ^*-invariant coherent prevision L on 
^(■5^*), or in other words a generalised Banach limit on ^{3*), we can now define 
a new lower prevision on ^(^) by Pi{f) '.— L{fp), and Proposition 1211 as well as 
Theorem|22l can now easily be generalised from monoids of transformations with a single 
generator to arbitrary directed monoids. In particular, we find that 

Ep^(f)= inf liminfPfr*/) and£^(/) = inf \immfP(T*f) 

for any gamble / on where P is any weakly J^'-invariant lower prevision that avoids 
sure loss. 



9. Permutation invariance on finite spaces 

Assume now that ^ is a finite group ^ of permutations of Then we have the 
following characterisation result for the weakly ^-invariant coherent lower previsions. 

Theorem 24. Let ^ be a finite group of permutations of 3^ . All weakly -invariant 
coherent lower previsions Q on ^{ ^) have the form 

Q=T^^L^P^ (19) 

1^1 7ie.^ 

where is the number of permutations in 3^, and P is any coherent lower prevision on 

if(jr). 

Proof. Consider a coherent lower prevision P on ^(^), and let Q be the corresponding 
lower prevision, given by Eq. (fT9] l. Then Q is coherent, as a convex mixture of coherent 
lower previsions nP. Moreover, let CJ be any element of then 

where (D^ ^ {(Dn : n G ^} = 3^ , because ^ is a group of permutations. Consequently 
U5Q = 2^ so 2 is weakly J^-invariant. 

Conversely, let Q be any weakly ^-invariant coherent lower prevision, then we recover 
Q on the left-hand side if we insert Q in the right-hand side of Eq. ( fT9l ). So any weakly 
,^3^-invariant coherent lower prevision is indeed of the form ( fT9] ). □ 
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Next, we give an interesting representation result for the strongly iJ^-invariant coherent 
lower previsions, when in addition, ^ is a finite set0 As we shall see further on, this es- 
sentially simple result has many interesting conse quences, amongst which a generalisation 



to coherent lower previsions of .de Finettij ' s 1 1937 1 representation result for finite sequences 



of exchangeable random variables (see Section |9^ . Recall that is the set of all f^- 
invariant atoms of X . For each A in s^;^, define P"(-|A) as the coherent prevision on 
^(^) all of whose probabihty mass is uniformly distributed over A, i.e., for all gambles 
/on JT: 

Finally, let P"{f\s^i^) denote the gamble on that assumes the value P^(^f\s^i^)[A) :— 
P"{f\A) in any element A of 

Theorem 25. Let ^ be a group of permutations of the finite set . A coherent lower 
prevision on ^(^) is strongly ^-invariant if and only if P{f) = EoiP" {f\-^£i>')) for all 
f in ^(^), where Pq is an arbitrary coherent lower prevision on ^{s/a^). 

Proof. We begin with the 'if part. Let P_q be an arbitrary coherent lower prevision on 
.if (^^), and suppose that P = Pj^{P''{-\s!^ayi)). Then it is easy to see that P is coherent. 
We show that P_ is strongly ^^-invariant. Consider any gamble / on and any n £ 0^. 
Then for any Am s^ijn and any gamble / on 

P"{f~n'f\A) V [f{x)-f{Kx)] = 0, 

because x G A is equivalent to nx £ A. So we see that P_{f — n' f) = Pq{0) = 0, since Pq 
is coherent. In a similar way, we can prove that P{7t'f — f ) —0, so P is indeed strongly 
^-invariant. 

To prove the 'only if part, we first concentrate on the case of a ^-invariant coherent 
prevision P on ^(^). Fix any gamble / on Since P is a coherent prevision, we find 
that 

£ //^ and P{f)= ^ P(//a)= E Pif\A)P{A), 

where we have used Bayes's rule to define P(/|A) := P{flA)/P{A) if P(A) > and P(/|A) 
is arbitrary otherwise. 

Now assume that P is ^-invariant. Fix any i^-invariant atom A in s^c^ such that 
P{A) > and let n £ ^. For any gamble / on JT', we see that tt'C/Za) = {n' f)lA, since A 
is in particular TT-invariant. Consequently 

P{n'f\A)^P{[n'f)lA)/P{A)^P{n'[flA))lP[A)=P{fh)/P{A)=P{f\A), 

so P(-|A) is ^^-invariant as wellFl Now let for any y in the finite set A, p{y\A) :~ 
P{{y}\A) > 0, then on the one hand T,xeAPix\A) = ^"(^1^) = 1- On the other hand, it 
follows from the TT-invariance of P(-|A) that p{x\A) — p{nx\A) for any x in A. Since we 
know from Proposition [T] that A = {nx: n G ^}, we see that p{-\A) is constant on A, 



^*'We find the 'permutation symmetry" between Theorems [24] and [25] quite surprising: the former states that 
a weakly .55^-invariant coherent lower prevision is a uniform prevision (or mixture) of coherent lower previsions, 
and the latter that a strongly ,S^-invariant coherent lower prevision is a coherent lower prevision of uniform 
previsions. 

This is an instance of a more general result, namely that coherent conditioning of a coherent lower prevision 
on an invaiiant event preserves both weak and strong invariance. A proof of this statement is not difficult, but 
outside the scope of this paper. 
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SO p{x\A) = 1/\A\ for all x in A, and consequently P{f\A) = P"(/|A), whence P{f) = 
Lagj<>> P''{f\^)P{^)- So indeed there is a coherent prevision Pq on ^{^^s), defined by 
Pq{{A})=P{A) for allA e such that P = Po(f'"(-Kf?'))- 

Finally, let P be any strongly ^-invariant coherent lower prevision, so any P E ^(P) 
is ,£J'-invariant and can therefore be written as P = Po{P" {■\£/.ip>)). If we let Pq be the 
(coherent) lower envelope of the set {Pq : P £ ^{P}}, then since P is the lower envelope 
of ^ (£) , we get immediately that P = f q ( • ) ) • ^ 

As an immediate corollary, we see that that the uniform coherent prevision P" on 
^{^) is the only strongly ^-invariant coherent lower prevision on ^{J^) if and only 
if is the only J^-invariant atom, i.e., if = {^}. This is for instance the case if ^ 
is the group of all permutations of or more generally if ^ includes the cyclic group 
of permutations of ,9^ . It should therefore come as no surprise that, since symmetry of 
beliefs is so often confused with beliefs of symmetry, the uniform distribution is so often 
(but wrongly so) considered to be a good model for complete ignorance. 

Another immediate corollary of this result is that the smallest strongly ^-invariant 
coherent lower prevision on ^{^) is given by P{f) = infyig^^ ^ILxeAfi^)' which of 
course agrees with the uniform distribution when we let ^ be the group of all permuta- 
tions. 

These results do not extend to the case where we have transformations of ^ that are 
not permutations; as we have said before, as soon as we have two different constant trans- 
formations in the monoid there are no strongly invariant beUef models. 



9.1. A few simple examples. We now apply the theorems above in a number of interesting 
and simple examples. 

Example 5. Let ^ = 3^2 '■= {1)2}, then all coherent lower previsions on ^(^) are so- 
called linear-vacuous mixtures, i.e., convex combinations of a coherent (hnear) prevision 
and the vacuous lower prevision, and therefore given by 

P(/) = e [a/(l) + (1 - a)/(2)] + (1 - e)min{/(l),/(2)}, 

where < a < 1 and < e < 1. Let ^2 be the set of all permutations of Then the 
only strongly ^2-invariant coherent lower prevision is the uniform coherent prevision 

corresponding to a = 5 and e = 1. The weakly ,^2-invariant coherent lower previsions 
are given by 

P{f) = e^i (/) + (!- e)min{/(I),/(2)}, 

where < e < I, so they are all the convex mixtures of the uniform coherent prevision and 
the vacuous lower prevision. ♦ 
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Example 6. Let ^ — ^3 := {1,2,3}, then all 2-monotone coherent lower previsions on 
if (^3) are given bjH 

P(/)=mi/(l)+m2/(2)+m3/(3) 

+ m4min{/(l),/(2)}+m5min{/(2),/(3)}+m6min{/(3),/(l)} 

+ m8min{/(l),/(2),/(3)}. 

where < < 1 and Y,l=i '^k = 1. Let ^3 be the set of all permutations of Then the 
only strongly ^3-invariant coherent lower prevision is the uniform coherent prevision 

^(/) = ^[/(l)+/(2)+/(3)], 

corresponding to mj = 1112 = = ^ and niA, — ms — mg = nij = mg = [Observe that 
a coherent prevision is always 2-monotone.]. Weak ^j- invariance, on the other hand, 
requires only that mi = m2 = mj, and m4 — — me, so all the weakly ^3-invariant and 
2-monotone coherent lower previsions are given by 

a/) = ^[/(i)+/(2)+/(3)] 

+ ^[min{/(l),/(2)}+min{/(2),/(3)}+min{/(3),/(l)}] 
+ M3min{M±M,M±M,M±Z(l)| 

+ M4min{/(l),/(2),/(3)}. 

where < Mj. < 1 and Mi +M2 +M3 +M4 = 1. The weakly ^3 -invariant and completely 
monotone coherent lower previsions (natural extensions of belief functions) correspond to 
the choice M3 = 0. ♦ 

Example 7. Consider rolling a die for which there is evidence of symmetry between 
all even numbers, on the one hand, and between all odd numbers on the other Let 
^ = ^ := {1, ... ,6} and let be the set of all permutations of ^ that map even 
numbers to even numbers and odd numbers to odd numbers. The i3^eo-invariant atoms are 
{1,3,5} and {2,4,6}. By Theorem |25l the strongly ^eo-invariant coherent previsions on 
.jSf (<^), which are the precise belief models that are compatible with the subject's beliefs 
of symmetry, are given by 

f (/) = f [/(I) +/(3) +/(5)] + ^[/(2) +/(4) +/(6)], 



An explicit proof of this statement is beyond the scope of this paper, but it runs along the following lines: 
(i) any coherent lower probability on the set of all events of a three-element space is 2-monotone | Walley, 1981], 
p. 58]; (ii) all 2-monotone coherent low er probabilities make up a convex set, and are convex mixtures of the 
extreme points of this set iMaad . [2003( . Chapter 2] (By the way, an argument similar to that in ,Maal3l l2003l 
Chapter 2] shows that all strongly ^'-invariant coherent lower previsions are (infinite) convex mixtures of the 
extreme strongly ^'-invariant coherent lower previsions.); (iii) the 2-monotone coherent lowe r previsions on 
all gambles are natural extensi ons of the 2-monotone coherent lower previsions on all events IWallevl . ll98lL 
iDe Cooman etan.l2006ll2005tll3l ; and (iv) natural extension to gambles of 2-monotone lower probabilities pre- 
serves convex mixtures. 
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where < a < 1 , and more generally, the strongly ,^^eo-invariant coherent lower previsions 
on ^(^) are [apply Theorem|25]and use the results in Example|5l 



forO < e < 1 andO < a < 1. ♦ 

Example 8. Let us show that the point-wise smallest strongly invariant coherent lower 
prevision extension is not necessarily 2-monotone. Consider ^ := {1,2,3,4}, and let n 
be the permutation of ^ defined by 7r(l) — 2, n{2) — 1, 7r(3) = 4 and n{4) ~ 3. Observe 
that n is its own inverse, so ^ = {id ff^, tt} is a group. From Theorem|25]we infer that the 
point-wise smallest strongly TT-invariant coherent lower prevision on all gambles is given 



Let us now consider the gambles f\ and /2 on givenby /i (1) = 0, /i (2) = — 1, /i (3) = 
l,/i(4) = -1 and/2(l) = -l,/2(2) = -0.25,/2(3) = -1.5, /2(4) 0. Checkthat 

KAfi A/2) V/2) - -1.25-0.125 = -1.375 < -0.5 -0.75 = +£^(/2). 

Hence, £^ is not 2-monotone. ♦ 

The following example shows that possibility measures are not very useful for mod- 
elling permutation invariance. 

Example 9. Consider a possibility measure n defined on all events of a finite space 
Then there is a map A : <^ ^ called the possibility distribution of H, such that 
X{x) :— n({x}) and moreover Tl{A) = maXx^AX{x) for all non-empty events A C 
We have mentioned before that H is a coherent upper probability if and only if n( = 
maXve.a;' = 1. We shall assume this is the case. Now consider any group ^ of per- 
mutations of ,S^. Then clearly n is weakly ^^-invariant if and only if A is constant on the 
.^J^-invariant atoms of In particular, n is weakly invariant with respect to all permuta- 
tions if and only is X is everywhere equal to one, so n is the vacuous upper probability. 

For strong ^-invariance, let P be any strongly ^-invariant coherent lower prevision 
whose domain contains at least all events. Let x be any element of J^, and let [x]^ be 
the ,^3^-invariant atom that contains x. Then it follows from Theorem |25] that P{{x}) < 
l/\[x]^\. So for P to extend a possibility measure, it is necessary (but not sufficient) that 
there is at least one element z of ^ such that P{{z}) ~ 1, implying that z should be left 
invariant by all the permutations in or equivalently, [z] ^ — {z}. ♦ 

9.2. Exchangeable lower previsions. As another example, we now discuss the case of 
so-called exchangeable coherent lower previsions. Consider a non-empty finite set ■= 
{1, . . . , Jf} of categories, and random variables Xi, . . . , Xn taking values in the same set 
where K and are natural numbers with K>2 and N >l. The joint random variable 
X :— [Xi , . . . ,X]\i) assumes values in the set := JT^^H We want to model a subject's 
beliefs about the value that X assumes in and generally, we use a coherent lower 
prevision P on to represent such beliefs. 




} 



by 




This means that we assume these N random variables to be logically independent. 



54 



GERT DE COOMAN AND ENRIQUE MIRANDA 



Now assume that our subject believes that all random variables are generated by the 
same process at different times k, and that the properties of this process do not depend 
on the time k. So, the subject assesses that there is permutation symmetry between the 
different times k. How can such beliefs of symmetry be modelled? 

With a permutation ;r of { 1 , . . . , A^}, we can associate (by the usual procedure of lifting) 
a permutation of = also denoted by tt, that maps any x = [x\,...,xn) in to 
TTx := (xjf(i), . . . ,JC;[(Af)). The belief models that are compatible with the subject's beliefs 
of symmetry, are therefore the coherent lower previsions on (subsets of) ^{,9^^) that are 
strongly ^ y-invari a nt, wh ere ,0^^ is the group of liftings to of all permutations of 

11 1wallevl|l 19911 Chapter 9] calls such lower previsions exchangeable, as they 

generalise Ide Finetti 's 1 1937 1 notion of exchangeable coherent previsions. We intend to 
characterise the exchangea ble lower p re vision s using Theorem |25] This will lead us to a 
generalisation (Eq. (|20] |) of Ide Finettir s lll937l1 representation result for finite numbers of 
exchangeable random variables. 

It should be mentioned here that we should, as always, clearly distinguish between 
'beliefs of symmetry' and 'symmetry of beliefs' . The latter imposes much weaker require- 
ments on cohere nt lower previs ions, namel y th ose of weak ,^3^^-invariance, which is called 
permutability by lWalleyl lll99l[ Chapter 9]H In particular, the permutation symmetry that 
goes along with ignorance can only be invoked to justify permutability, but not, of course, 
exchangeability. Observe in this respect that the vacuous lower prevision on I IS 

permutable, but not exchangeable. It is well-known (see for instance lZabell 1 1989a , 1992']), 
that Laplace's Rule of Succession can be obtained by updating a particular exchangeable 
coherent prevision, but it should be clear from the discussion in this paper that ignorance 
alone (the Principle of Insufficient Reason) cannot be invoked to justify using such an ex- 
changeable prevision, as (with cons i derable hindsight) La place implicitly seems to have 
done (see for instance |Howii i2002ll . lZabeiil il989atll992ll ). 

For any x = (xi , . . . ,xn) in the ,^3^^-invariant atom [x]^n is the set of all permuta- 
tions of (the components of) x. If we define the set of possible count vectors 



= I (mi , ... , m^) : m^ £ and £^ m^. = A^| 

and the counting map T: ,9^^ ,yV^ such that T(xi, . . . ,xn) is the fc-tuple, whose ^-th 
component is given by 

Tk{x]_,...,XN) = \{ee xe^k}], 

i.e., the number of components of x whose value is k, then the number of elements of the 
invariant atom [x] is precisely 



v(T(x)) 



A^ 

ri(x)...r^(x) 



Nl 



Ti{x)\...T^(x)l 



,N and . An invariant atom is 



and T is a bijection (one-to-one and onto) between ■ ''^^ 
therefore completely identified by the count vector T(x) of any of its elements x, and we 
shall henceforth denote the invariant atoms of by [m], where m=(mi,...,mK:) G , 
and X e [m] if and only if T(x) = m. 



^"See lWalle^j Il99ll Chapter 9] for a much more detailed discussion of the difference between permutability 
and exchangeability. 
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The coherent prevision P"{-\m) on ^( JT^) whose probability mass is uniformly dis- 
tributed over the invariant atom [m] is given by 

^ ' xG[mJ 

Interestingly, this is the precise prevision that is associated with taking a-select drawings 
without replacement from an urn with balls, mi of which are of type 1, . . . , and iiik 
of which are of type K. Theorem |25] now tells us that any exchangeable coherent lower 
prevision P on J!f{^^) can be written as 

P{f)=E!'AP"{f\^.'')): (20) 

where is some coherent lower prevision on This means that such an ex- 

changeable lower prevision can be associated with N a-select drawings from an urn with 
N balls of types 1, . . . , K, whose composition m is unknown, but for which the available 
information about the unknown composition is modelled by a coherent lower prevision P^. 

That exchangeable coherent previsions can be interpreted in terms of sampling without 
replacement from an urn with unkn o wn composition, is a ctually well-known, and essen- 



tially goes back to de Finetti 1 1937 1. Heath and Sudderth j 1976.1 give a simple proof for 



random variables that may assume two values. But we believe our proo0 for the more 
general case of exchangeable coherent lower previsions and random variables that may 
assume more than two values, is conceptually even simpler than Heath and Sudderth's 
proof, even though it is a special case of a much more general representation result (The- 
orem |25]l. The essence of the p resent p r oof in the special case of coherent previsions P 



is captured wonderfully well bv lZabellT s 1119921 Section 3.1] succinct statement: "Thus P 



is exchangeable if and only if two sequences having the same frequency vector have the 
same probability." 

Our subject's beliefs could, in addition, be symmetrical in the categories in — 
{1, . . . , k}, for instance as a result of her ignorance about the process that generates the 
outcomes X/^ at each time k. As we have seen, this will be typically represented by using 
a type of weakly invariant belief models, in this case with respect to permutations of the 
categories, rather than the times. Any permutation of induces a permutation of 
also denoted by dJ, through 

(Dx — (D{xi, . . . ,xn) := (CT(xi), . . . ,(D{xn)). 

What happens if we require that P, in addition to being exchangeable, should also be 
weakly invariant under all such permutations? It is not difficult to prove that 

P"{CJ-\f\m)=P"{f\(Dm), 

where we let (Dm = Cl(mi , . . . ^m^) '■— (»Ji57(i)7 ■ ■ ■ - f^miK)) the usual fashion. This im- 
plies that there is such weak invariance if and only if the coherent lower prevision P^ on 
^{.yVj^} is weakly invariant with respect to all category permutations! In particular, this 
weak invariance is satisfied for the vacuous lower prevision on ^{^j^). Another type 
of lower coherent prevision that exhibits such a combination of strong invariance for time 
permutations and weak invariance for category permutations, and which also has other 
very special and interesting properties, is constructed by taking lower envelopes of specific 
sets of Dirichlet-Multinomial di stributions, leading to the s o-called Imprecise Dirichlet- 



Multinomial Model (IDMM, see lWalley and Bernard! 0199911 ^ 



Chapter 9] also mentions this result for exchangeable coherent lower previsions. The essence 
of his argument is similar to what we do in the last paragraph of the proof of Theorem l25 1 
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In the literature, however, it is sometimes required that a coherent precise prevision 
should be invariant with respect to the combined action of the permutation s of times and 



categories. These are the so-called partition exchangeable previsions (see IZabelll 111992 1 
for an interesting discussion and historical overview). Of course, the generalisation of 
this notion to coherent lower previsions should be strongly invariant with respect to such 
combined permutations, and therefore be a lower envelope of partition exchangeable pre- 
visions. For such partition exchangeable lower previsions, Theorem|25]can be invoked to 
prove a representation result that is similar to that for coherent lower previsions that are 
only exchangeable. It should be clear that they correspond to exchangeable lower previ- 
sions for which the corresponding coherent lower prevision on J^{,yy^) is strongly 
rather than just weakly invariant with respect to all category permutations. Of course, any 
justification for such models should be based on beliefs that there is permutation symmetry 
in the categories behind the process that generates the outcomes Xj^ at different times k, and 
cannot be justified by mere ignorance about this process. 

9.3. Updating exchangeable lower previsions: predictive inference. Finally, let us dis- 
cuss possible applications of the discussion in this paper to predictive inference. As- 
sume that we have n* random variables X\, ...X,,*, that may assume values in the set 
— {1, . . . , k}. We assume that these random variables are assessed to be exchange- 
able, in the sense that any coherent lower prevision that describes the available information 
about the values that the joint random variable X* — {Xi, . . . ,X„* ) assumes in should 
be exchangeable, i.e., strongly -invariant. This requirement could be called pre-data 
exchangeability. So we know from the previous section that such a coherent lower previ- 
sion must be of the form P = {P"{'\^k ))' where P"^ is some coherent lower prevision 
on .5f{j\^^ ). We shall assume that P'^ is a lower envelope of a set of coherent previsions 
on^(^/). 

Suppose we now observe the values x = (xi, . . . ,x„) of the first n random variables 
X = {Xi, ...,Xn), where 1 < n < n*. We ask ourselves how we should coherently update 
the belief model P to a new model P(-|x) which describes our beliefs about the values 
of the remaining random variables X' = (^n+i, . . . ,X„*). This is, generally speaking, the 
problem of predictive inference. In order to make things as easy as possible, we shall 
assume that P({x}) > 0, so our subject has some reason, prior to observing x, to believe 
that this observation will actually occur, because she is willing to bet on its occurrence at 
non-trivial odds. 

Let us denote by n' = n* — n the number of remaining random variables, then we know 
that X' assumes values in , and P(-|x) will be a lower prevision on ^^{^^ ). 

We shall first look at the problem of updating the coherent prevision P = Q{P" {'I^k )) 
for any coherent prevision Q in . So consider any gamble g on . It follows from 
coherence requirements (Bayes's rule) that the updated coherent prevision P(-|x) is given 
by 

where 4 (x* ) = 1 if the first n components of the vector x* G are given by the vector x, 
and zero otherwise. Observe, by the way, that by assumption, P(/x) > P(^x) = P({x}) > 0. 
Now for any m* in we find that, with obvious notations, 

'-<«''i-"*'^^„, E /C'l^^^w^'"'*'"''-"" '''' 

^ ' T'(x')+m=m* ^ ' 
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where we let m = T(x),and where T' maps samples x' in S"^'' to their corresponding count 
vectors T'(x') in JV^ . Of course v(in* — m) is non-zero only if m* > m, or equivalently if 
m* — m g jV^ , or in other words if it is possible to select n balls of composition m without 
replacement from an urn with composition m*. In this expression, P"{-\in') stands for the 
coherent prevision on ^( jr^" ) whose probability mass is uniformly distributed over the 
-invariant atom [m'], for any m' in ./(^" . Now for g = I we find that 

^"(^x|m*) = ^^^^^^ = P(m|m*) =: L™(m*) (23) 

is the probability of observing a sample of size n with composition m by sampling without 
replacement from an urn with composition m*. is the corresponding likelihood function 
on c/f^" . We may as well consider as a likelihood function on ,y(^" , and for any m' in 
.yy^' we let 

L„ m' Lm m + m' = , \ 

v(m + m') 

be the probability that there remain n' balls of composition m' after drawing (without 
replacement) n balls of composition m from an urn with n* balls. We may then rewrite 
Eq. ( I2TI ). using Eqs. (22\ and ( |23] l. as 

Pig\-) - ^^"""ejiy^"'^^ - e(P"(g|^/)|m), (24) 

where 2(Lm) =P(Ix) > Oby assumption, and 2(-|in) is the coherent prevision on ^(o/f^"') 
defined by 

for any gamble /i on o/f^" , i.e., 2( |m) is the coherent prevision obtained after using Bayes's 
rule to update Q with the likelihood function L„,. This means that if Q is a belief model 
for the unknown composition of an urn with n* balls, then 2(-|in) is the corresponding 
model for the unknown composition of the remaining n' balls in the urn, after n balls with 
composition m have been taken from it. 

Now if we have a coherent lower prevision on ) that is a lower envelope 

of a set of coherent previsions Q, then coherenc^ tells us that the updated lower 
prevision £( |x) is precisely the lower envelope of the corresponding updated coherent 
previsions P{-\x), and consequently, using Eqs. ( l24l l and ( l25T l. we find that 

P{g\x)=Pi{P^'{g\^,"')\m), (26) 

where P"^ (-Im) is the coherent lower prevision on ^{JK^ ) given by 

PUh\m) inf { : g G | = inf {e(/;|m) : Q e } , (27) 

for any gamble h on . In other words, P^*(-|in) is the coherent lower prevision ob- 
tained after using coherence (the so-called Generalised Bayes Rule) to update P'^ with 
the likelihood function La,. This means again that ifP'l; is a belief model for the unknown 
composition of an urn with n* balls, then (• |ni) is the corresponding belief model for the 
unknown composition of the remaining n' balls in the urn, after n balls with composition 
m have been taken from it. 



''^This follows from lWallevl 's ll99lL Section 6.5] Generalised Bayes Rule. 
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If we compare Eq. (|26] | with Eq. ( l20l i. we see that the updated behef model £(-|x) is 
still strongly -invariant^ so there still is post-data exchangeability for the remaining 
random variables X' = (X„+i, . . . ,X„*). Moreover, by looking at Eq. (l2Tl l and Eqs. ( |26] | 
and ( |27] |. we see that the updated (lower) previsions P{-\s.) and £(-|x) only depend on 
the observed sample x through the likelihood function Lx(x)- This tells us that this type 
of predictive inference satisfies the so-called likelihood principle, and moreover that the 
count vector m = T(x), or more generally the map T is a sufficient statistic. 

10. Conclusions 

We have tried to argue that there is a clear distinction between the symmetry of belief 
models, and models of beliefs of symmetry, and that both notions ca n be distinguished 
between when indecision is taken seriously, as is the case in IWallevl 's lfl99lll behavioural 



theory of imprecise probabilities. Our present attempt to distinguish between these no- 
tions, and capture the distinction in a formal way, is inspired by Walley's [1991, Chapter 9] 
discussion of the dif fe rence between permutable and exchangeable lower previsions, and 



Pericchi and Wallevr s 11199111 discussion of 'classes of reasonable priors' versus 'reason- 
able classes of priors'. 

Indeed, there seems to be a difference of type between the two notions. The former 
(symmetry of models) is a property that belief models may have, and we may require, as 
a principle of rationality, or as a principle of 'faithful modelling', that if the available evi- 
dence is symmetrical, then our corresponding belief models should be symmetrical too. A 
case in point is that of complete ignorance, where the 'evidence' is completely symmetri- 
cal, and we may therefore require that corresponding belief model should be completely 
symmetrical too. This leads to the various principles discussed in Section |5] all of which 
seem to sing le out th e vacuo us belief model for representing complete ignorance, and 



which extend I Wallew s 11199 IL Section 5.5] treatment of this matter. 

The latter notion (models of symmetry) is more properly related to a type of structural 
assessment: if a subject believes there is symmetry, how should she model that, and how 
should assessments of symmetry be combined with other assessments? We have tried 
to answer such questions in Sections |7] where we discuss the strongly invariant natural 
extension. 

It is well-known that if we only use Bayesian, or precise, probability models, requiring 
invariance of the probability measures with respect to all types of symmetry in t he evi 



dence m ay be impossible; examples were given by Boole, Bertrand and Fisher (see lZabeU 



1 1989all For discussion and references). This has led certain researchers to abandon re 



quiring the above-mentioned 'faithfulness' of belief models, or to single out certain types 
of symmetry which are deemed to be better than others. We have tried to argue that this 
is unnecessary: the vacuous belief model has no such problems, and is symmetrical with 
respect to any transformation you care to name. And of course, our criticism of the Prin - 



ciple o f Insufficient Reason is not new. Our ideas were heavily influenced by I Wallevr s 



1 199111 book on imprecise probabilities, whose Chapter 5 contains a won derful overview 



of arguments against restricting ourselves to precise probability models. IZabelll lll989bll 



also gives an excellent discus sion o f much older criticism, dating back to the middle of 



the 19th century. In particular. l EUisl 's 11 184411 ex nihilo nihil — you cannot make decisions 



or inferences based on complete ignorance — finds a nice confirmation in the fact that 
the vacuous belief model captures complete indecision , and that updating a vacuous belief 



model leads to a vacuous belief model llWalleyL ll99lL Section 6.6.1]. But what we have 



^^See also footnote BT] 
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tried to do here is provide a framework and mathematical apparatus that allows us to better 
understand and discuss the problems underlying the Principle of Insufficient Reason, and 
more general problems of dealing with any type of symmetry in belief models. 

This study of symmetry in relation to belief models is far from being complete how- 
ever, and our notions of weak and strong invariance may have to be refined, and perhaps 
even modified, as well as complemented by other notions of symmetry. It might for in- 
stance be of interest to study the notion of symmetry that captures the insufficient reason 
to strictly prefer that is briefly touched upon near the end of Section 14.11 Also, we may 
seem more certain than we actually are about the appropriateness (in terms of having a 
sound behavioural justification and interpretation) of our notions of weak and (especially) 
strong invariance for random variables that may assume an infinite number of values. This 
is the point where our intuition deserts us, and where a number of interesting questions 
and problems leave us speechless. To name but one such problem, brought to the fore 
by the discussion in Section |2l for certain types of monoids, it is completely irrational 
to impose strong invariance (because doing so makes us subject to a sure loss). We can 
understand why this is the case for the monoid of all transformations, even on a finite set 
(Theorem|6|. But why, for instance, are there no (strongly) permutation invariant coher- 
ent (lower) previsions on the set of natural (and a fortiori real) numbers? Why are we 
(consequently) reduced to using (strong) shift or translation invariance of coherent (lower) 
previsions when we want to try and capture the idea of a uniform distribution on the set 
of natural (or real) numbers? And even then, why, as is hinted at in footnote|50l are there 
situations where updating a (strongly) shift-invariant coherent (lower) prevision produces 
a sure loss? Are there appropriately weakened versions of our strong invariance condition 
that avoid these problems? 
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